AU 1814 

PATENT APPLICATION US/08/083, 590A TIME: 11:55:51 



PAGE: 1 RAW SEQUENCE LISTING DATE: 02/04/94 

: API 



INPUT SET: S1376.raw 



1 SEQUENCE LISTING 
2 

3 (1) General Information: 
4 

5 (i) APPLICANT: Artavanis-Tsakonas , S. et al . 

6 

7 

8 (ii) TITLE OF INVENTION: Therapeutic And Diagnostic Methods 

9 And Compositions Based On Notch Proteins And 
10 Nucleic Acids 

11 

12 (iii) NUMBER OF SEQUENCES: 21 

13 

14 (iv) CORRESPONDENCE ADDRESS: 

15 (A) ADDRESSEE: Pennie & Edmonds 

16 (B) STREET: 1155 Avenue of the Americas 

17 (C) CITY: New York 

18 (D) STATE: New York 

19 (E) COUNTRY: U.S.A. 

20 (F) ZIP: 10036 
21 

22 (v) COMPUTER READABLE FORM: 

23 (A) MEDIUM TYPE: Floppy disk 

24 (B) COMPUTER: IBM PC compatible 

25 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

26 (D) SOFTWARE: Patentln Release #1.0, Version #1.25 
27 

28 (vi) CURRENT APPLICATION DATA: 

29 (A) APPLICATION NUMBER: 08/083,590 

30 (B) FILING DATE: 25-JUN-1993 

31 (C) CLASSIFICATION: 
32 

33 (viii) ATTORNEY/AGENT INFORMATION: 

34 (A) NAME: Misrock, S. Leslie 

35 (B) REGISTRATION NUMBER: 18,872 

36 (C) REFERENCE /DOCKET NUMBER: 7326-015 
37 

38 (ix) TELECOMMUNICATION INFORMATION: 

39 (A) TELEPHONE: 212 790-9090 

40 (B) TELEFAX: 212 8698864/9741 

41 (C) TELEX: 66141 PENNIE 
42 

43 

44 (2) INFORMATION FOR SEQ ID NO : 1 : 
45 

46 (i) SEQUENCE CHARACTERISTICS: 

47 (A) LENGTH: 2892 base pairs 

48 (B) TYPE: nucleic acid 

49 (C) STRANDEDNESS : double 

50 (D) TOPOLOGY: unknown 
51 




PAGE: 2 RAW SEQUENCE LISTING DATE: 02/04/94 

PATENT APPLICATION US/08/083, 590A TIME: 11:55:56 

INPUT SET: S1376.raw 



52 (ii) MOLECULE TYPE: cDNA 

53 

54 

55 (ix) FEATURE: 

56 (A) NAME /KEY : CDS 

57 (B) LOCATION: 142.. 2640 
58 

59 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

61 

62 GAATTCGGAG GAATTATTCA AAACATAAAC ACAATAAACA ATTTGAGTAG TTGCCGCACA 60 
63 

64 CACACACACA CACAGCCCGT GGATTATTAC ACTAAAAGCG ACACTCAATC CAAAAAATCA 120 
65 

66 GCAACAAAAA CATCAATAAA C ATG CAT TGG ATT AAA TGT TTA TTA ACA GCA 171 

67 Met His Trp lie Lys Cys Leu Leu Thr Ala 

68 1 5 10 
69 

70 TTC ATT TGC TTC ACA GTC ATC GTG CAG GTT CAC AGT TCC GGC AGC TTT 219 

71 Phe lie Cys Phe Thr Val He Val Gin Val His Ser Ser Gly Ser Phe 

72 15 20 25 
73 

74 GAG TTG CGC CTG AAG TAC TTC AGC AAC GAT CAC GGG CGG GAC AAC GAG 2 67 

75 Glu Leu Arg Leu Lys Tyr Phe Ser Asn Asp His Gly Arg Asp Asn Glu 

76 30 35 40 
77 

78 GGT CGC TGC TGC AGC GGG GAG TCG GAC GGA GCG ACG GGC AAG TGC CTG 315 

79 Gly Arg Cys Cys Ser Gly Glu Ser Asp Gly Ala Thr Gly Lys Cys Leu 

80 45 50 55 
81 

82 GGC AGC TGC AAG ACG CGG TTT CGC GTC TGC CTA AAG CAC TAC CAG GCC 363 

83 Gly Ser Cys Lys Thr Arg Phe Arg Val Cys Leu Lys His Tyr Gin Ala 

84 60 65 70 
85 

86 ACC ATC GAC ACC ACC TCC CAG TGC ACC TAC GGG GAC GTG ATC ACG CCC 411 

87 Thr He Asp Thr Thr Ser Gin Cys Thr Tyr Gly Asp Val He Thr Pro 

88 75 80 85 90 
89 

90 ATT CTC GGC GAG AAC TCG GTC AAT CTG ACC GAC GCC CAG CGC TTC CAG 4 59 

91 He Leu Gly Glu Asn Ser Val Asn Leu Thr Asp Ala Gin Arg Phe Gin 

92 95 100 105 
93 

94 AAC AAG GGC TTC ACG AAT CCC ATC CAG TTC CCC TTC TCG TTC TCA TGG 507 

95 Asn Lys Gly Phe Thr Asn Pro He Gin Phe Pro Phe Ser Phe Ser Trp 

96 110 115 120 
97 

98 CCG GGT ACC TTC TCG CTG ATC GTC GAG GCC TGG CAT GAT ACG AAC AAT 555 

99 Pro Gly Thr Phe Ser Leu He Val Glu Ala Trp His Asp Thr Asn Asn 
100 125 130 135 

101 

102 AGC GGC AAT GCG CGA ACC AAC AAG CTC CTC ATC CAG CGA CTC TTG GTG 6 03 



PAGE: 3 RAW SEQUENCE LISTING DATE: 02/04/94 

PATENT APPLICATION US/08/083, 590A TIME: 1 1 :56:02 

INPUT SET: S1376.raw 



103 Ser Gly Asn Ala Arg Thr Asn Lys Leu Leu lie Gin Arg Leu Leu Val 

104 140 145 150 
105 

106 CAG CAG GTA CTG GAG GTG TCC TCC GAA TGG AAG ACG AAC AAG TCG GAA 651 

107 Gin Gin Val Leu Glu Val Ser Ser Glu Trp Lys Thr Asn Lys Ser Glu 

108 155 160 165 170 
109 

110 TCG CAG TAC ACG TCG CTG GAG TAC GAT TTC CGT GTC ACC TGC GAT CTC 6 99 

111 Ser Gin Tyr Thr Ser Leu Glu Tyr Asp Phe Arg Val Thr Cys Asp Leu 

112 175 180 185 
113 

114 AAC TAC TAC GGA TCC GGC TGT GCC AAG TTC TGC CGG CCC CGC GAC GAT 747 

115 Asn Tyr Tyr Gly Ser Gly Cys Ala Lys Phe Cys Arg Pro Arg Asp Asp 

116 190 195 200 
117 

118 TCA TTT GGA CAC TCG ACT TGC TCG GAG ACG GGC GAA ATT ATC TGT TTG 795 

119 Ser Phe Gly His Ser Thr Cys Ser Glu Thr Gly Glu lie lie Cys Leu 

120 205 210 215 
121 

122 ACC GGA TGG CAG GGC GAT TAC TGT CAC ATA CCC AAA TGC GCC AAA GGC 843 

123 Thr Gly Trp Gin Gly Asp Tyr Cys His lie Pro Lys Cys Ala Lys Gly 

124 220 225 230 
125 

126 TGT GAA CAT GGA CAT TGC GAC AAA CCC AAT CAA TGC GTT TGC CAA CTG 8 91 

127 Cys Glu His Gly His Cys Asp Lys Pro Asn Gin Cys Val Cys Gin Leu 

128 235 240 245 250 
129 

130 GGC TGG AAG GGA GCC TTG TGC AAC GAG TGC GTT CTG GAA CCG AAC TGC 939 

131 Gly Trp Lys Gly Ala Leu Cys Asn Glu Cys Val Leu Glu Pro Asn Cys 

132 255 260 265 
133 

134 ATC CAT GGC ACC TGC AAC AAA CCC TGG ACT TGC ATC TGC AAC GAG GGT 987 

135 lie His Gly Thr Cys Asn Lys Pro Trp Thr Cys lie Cys Asn Glu Gly 

136 270 275 280 
137 

13 8 TGG GGA GGC TTG TAC TGC AAC CAG GAT CTG AAC TAC TGC ACC AAC CAC 1035 

139 Trp Gly Gly Leu Tyr Cys Asn Gin Asp Leu Asn Tyr Cys Thr Asn His 

140 285 290 295 
141 

142 AGA CCC TGC AAG AAT GGC GGA ACC TGC TTC AAC ACC GGC GAG GGA TTG 1083 

143 Arg Pro Cys Lys Asn Gly Gly Thr Cys Phe Asn Thr Gly Glu Gly Leu 

144 300 305 310 
145 

146 TAC ACA TGC AAA TGC GCT CCA GGA TAC AGT GGT GAT GAT TGC GAA AAT 1131 

147 Tyr Thr Cys Lys Cys Ala Pro Gly Tyr Ser Gly Asp Asp Cys Glu Asn 

148 315 320 325 330 
149 

150 GAG ATC TAC TCC TGC GAT GCC GAT GTC AAT CCC TGC CAG AAT GGT GGT 1179 

151 Glu lie Tyr Ser Cys Asp Ala Asp Val Asn Pro Cys Gin Asn Gly Gly 

152 335 340 345 
153 



PAGE: 4 RAW SEQUENCE LISTING DATE: 02/04/94 

PATENT APPLICATION US/08/083, 590A time: 1 1 :56:07 

INPUT SET: S1376.raw 

154 ACC TGC ATC GAT GAG CCG CAC ACA AAA ACC GGC TAC AAG TGT CAT TGC 122 7 

155 Thr Cys lie Asp Glu Pro His Thr Lys Thr Gly Tyr Lys Cys His Cys 

156 350 355 360 
157 

158 GCC AAC GGC TGG AGC GGA AAG ATG TGC GAG GAG AAA GTG CTC ACG TGT 1275 

159 Ala Asn Gly Trp Ser Gly Lys Met Cys Glu Glu Lys Val Leu Thr Cys 

160 365 370 375 
161 

162 TCG GAC AAA CCC TGT CAT CAG GGA ATC TGC CGC AAC GTT CGT CCT GGC 1323 

163 Ser Asp Lys Pro Cys His Gin Gly lie Cys Arg Asn Val Arg Pro Gly 

164 380 385 390 
165 

166 TTG GGA AGC AAG GGT CAG GGC TAC CAG TGC GAA TGT CCC ATT GGC TAC 13 71 

167 Leu Gly Ser Lys Gly Gin Gly Tyr Gin Cys Glu Cys Pro lie Gly Tyr 

168 395 400 405 410 
169 

170 AGC GGA CCC AAC TGC GAT CTC CAG CTG GAC AAC TGC AGT CCG AAT CCA 1419 

171 Ser Gly Pro Asn Cys Asp Leu Gin Leu Asp Asn Cys Ser Pro Asn Pro 

172 415 420 425 
173 

174 TGC ATA AAC GGT GGA AGC TGT CAG CCG AGC GGA AAG TGT ATT TGC CCA 1467 

175 Cys lie Asn Gly Gly Ser Cys Gin Pro Ser Gly Lys Cys He Cys Pro 

176 430 435 440 
177 

178 GCG GGA TTT TCG GGA ACG AGA TGC GAG ACC AAC ATT GAC GAT TGT CTT 1515 

179 Ala Gly Phe Ser Gly Thr Arg Cys Glu Thr Asn He Asp Asp Cys Leu 

180 445 450 455 
181 

182 GGC CAC CAG TGC GAG AAC GGA GGC ACC TGC ATA GAT ATG GTC AAC CAA 1563 

183 Gly His Gin Cys Glu Asn Gly Gly Thr Cys He Asp Met Val Asn Gin 

184 460 465 470 
185 

186 TAT CGC TGC CAA TGC GTT CCC GGT TTC CAT GGC ACC CAC TGT AGT AGC 1611 

187 Tyr Arg Cys Gin Cys Val Pro Gly Phe His Gly Thr His Cys Ser Ser 

188 475 480 485 490 
189 

190 AAA GTT GAC TTG TGC CTC ATC AGA CCG TGT GCC AAT GGA GGA ACC TGC 1659 

191 Lys Val Asp Leu Cys Leu He Arg Pro Cys Ala Asn Gly Gly Thr Cys 

192 495 500 505 
193 

194 TTG AAT CTC AAC AAC GAT TAC CAG TGC ACC TGT CGT GCG GGA TTT ACT 1707 

195 Leu Asn Leu Asn Asn Asp Tyr Gin Cys Thr Cys Arg Ala Gly Phe Thr 

196 510 515 520 
197 

198 GGC AAG GAT TGC TCT GTG GAC ATC GAT GAG TGC AGC AGT GGA CCC TGT 1755 

199 Gly Lys Asp Cys Ser Val Asp He Asp Glu Cys Ser Ser Gly Pro Cys 

200 525 530 535 
201 

202 CAT AAC GGC GGC ACT TGC ATG AAC CGC GTC AAT TCG TTC GAA TGC GTG 1803 

203 His Asn Gly Gly Thr Cys Met Asn Arg Val Asn Ser Phe Glu Cys Val 

204 540 545 550 



PAGE: 5 RAW SEQUENCE LISTING DATE: 02/04/94 

PATENT APPLICATION US/08/083,590A TIME: 11:56:13 

INPUT SET: S1376.raw 



205 

2 06 TGT GCC AAT GGT TTC 

2 07 Cys Ala Asn Gly Phe 

208 555 
209 

210 TCG GTG ACC TTC GAT 

211 Ser Val Thr Phe Asp 



212 575 
213 

214 GCC GAT GGT TTG ACC 

215 Ala Asp Gly Leu Thr 

216 590 
217 

218 GTT GCG ATG CCT TTG 

219 Val Ala Met Pro Leu 

220 605 
221 

222 ATG AAG CGC AAG CGT 

223 Met Lys Arg Lys Arg 

224 620 
225 

226 AGG AAG CAG AAC GAA 

227 Arg Lys Gin Asn Glu 

228 635 
229 

230 AGT GGG GTG GGT GTA 

231 Ser Gly Val Gly Val 

232 655 
233 

234 GGC AGC AAC AGC GGT 

235 Gly Ser Asn Ser Gly 

236 670 
237 

238 AAA AAC ACC TGG GAC 

239 Lys Asn Thr Trp Asp 

240 685 
241 

242 GCA GCG GCG GCG GCG 

243 Ala Ala Ala Ala Ala 

244 700 
245 

246 GGA TAT GTG GCC TCG 

247 Gly Tyr Val Ala Ser 

248 715 
249 

250 TGT GTG GCT CCG CTA 

251 Cys Val Ala Pro Leu 

252 735 
253 

254 GAT CCC ACG CTC ATG 

255 Asp Pro Thr Leu Met 



AGG GGC AAG CAG TGC GAT GAG 
Arg Gly Lys Gin Cys Asp Glu 
560 565 

GCC CAC CAA TAT GGA GCG ACC 
Ala His Gin Tyr Gly Ala Thr 
580 

AAT GCC CAG GTA GTC CTA ATT 
Asn Ala Gin Val Val Leu He 
595 

GTG GCG GTT ATT GCG GCG TGC 
Val Ala Val He Ala Ala Cys 
610 

AAG CGT GCT CAG GAA AAG GAC 
Lys Arg Ala Gin Glu Lys Asp 
625 630 

CAG AAT GCG GTG GCC ACA ATG 
Gin Asn Ala Val Ala Thr Met 
640 645 

GCT TTG GCT TCA GCC TCT CTG 
Ala Leu Ala Ser Ala Ser Leu 
660 

CTC ACC TTC GAT GGC GGC AAC 
Leu Thr Phe Asp Gly Gly Asn 
675 

AAG TCG GTC AAC AAC ATT TGT 
Lys Ser Val Asn Asn He Cys 
690 

GCA GCA GCG GCG GAC GAG TGT 
Ala Ala Ala Ala Asp Glu Cys 
705 710 

GTG GCG GAT AAC AAC AAT GCC 
Val Ala Asp Asn Asn Asn Ala 
720 725 

CAA AGA GCC AAG TCG CAA AAG 
Gin Arg Ala Lys Ser Gin Lys 
740 

CAC CGC GGT TCG CCG GCA GGC 
His Arg Gly Ser Pro Ala Gly 



GAG TCC TAC GAT 18 51 

Glu Ser Tyr Asp 
570 

ACA CAA GCG AGA 18 99 

Thr Gin Ala Arg 
585 

GCT GTT TTC TCC 1947 
Ala Val Phe Ser 
600 

GTG GTC TTC TGC 1995 

Val Val Phe Cys 

615 

GAC GCG GAG GCC 2043 
Asp Ala Glu Ala 



CAT CAC AAT GGC 20 91 

His His Asn Gly 
650 

GGC GGC AAA ACT 2139 
Gly Gly Lys Thr 
665 

CCG AAT ATC ATC 2187 
Pro Asn lie He 
680 

GCC TCA GCA GCA 2235 

Ala Ser Ala Ala 

695 

CTC ATG TAC GGC 22 83 

Leu Met Tyr Gly 



AAC TCA GAC TTT 23 31 

Asn Ser Asp Phe 
730 

CAA CTC AAC ACC 2379 
Gin Leu Asn Thr 
745 

AGC TCA GCC AAG 2427 
Ser Ser Ala Lys 



PAGE: 6 RAW SEQUENCE LISTING DATE: 02/04/94 

PATENT APPLICATION US/08/083,590A TIME: 11:56:18 



INPUT SET: S1376.raw 



256 750 755 760 

257 

258 GGA GCG TCT GGC GGA GGA CCG GGA GCG GCG GAG GGC AAG AGG ATC TCT 2475 

259 Gly Ala Ser Gly Gly Gly Pro Gly Ala Ala Glu Gly Lys Arg He Ser 

260 765 770 775 
261 

262 GTT TTA GGC GAG GGT TCC TAC TGT AGC CAG CGT TGG CCC TCG TTG GCG 2523 

263 Val Leu Gly Glu Gly Ser Tyr Cys Ser Gin Arg Trp Pro Ser Leu Ala 

264 780 785 790 
265 

266 GCG GCG GGA GTG GCC GGA GCC TGT TCA TCC CAG CTA ATG GCT GCA GCT 2571 

267 Ala Ala Gly Val Ala Gly Ala Cys Ser Ser Gin Leu Met Ala Ala Ala 

268 795 800 805 810 
269 

270 TCG GCA GCG GGC AGC GGA GCG GGG ACG GCG CAA CAG CAG CGA TCC GTG 2619 

271 Ser Ala Ala Gly Ser Gly Ala Gly Thr Ala Gin Gin Gin Arg Ser Val 

272 815 820 825 
273 

274 GTC TGC GGC ACT CCG CAT ATG TAACTCCAAA AATCCGGAAG GGCTCCTGGT 2670 

275 Val Cys Gly Thr Pro His Met 

276 830 
277 

278 AAATC CGGAG AAATCCGCAT GGAGGAGCTG ACAGCACATA CACAAAGAAA AGACTGGGTT 273 0 
279 

280 GGGTTCAAAA TGTGAGAGAG ACGCCAAAAT GTTGTTGTTG ATTGAAGCAG TTTAGTCGTC 2790 
281 

282 ACGAAAAATG AAAAATCTGT AACAGGCATA ACTCGTAAAC TCCCTAAAAA ATTTGTATAG 2850 
283 

284 TAATTAGCAA AGCTGTGACC CAGCCGTTTC GATCCCGAAT TC 2892 

285 

286 

287 (2) INFORMATION FOR SEQ ID NO : 2 : 
288 

289 (i) SEQUENCE CHARACTERISTICS: 

290 (A) LENGTH: 833 amino acids 

291 (B) TYPE: amino acid 

292 (D) TOPOLOGY: unknown 
293 

294 (ii) MOLECULE TYPE: protein 

295 

296 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

297 

298 Met His Trp He Lys Cys Leu Leu Thr Ala Phe He Cys Phe Thr Val 

299 15 10 15 
300 

301 He Val Gin Val His Ser Ser Gly Ser Phe Glu Leu Arg Leu Lys Tyr 

302 20 25 30 
303 

3 04 Phe Ser Asn Asp His Gly Arg Asp Asn Glu Gly Arg Cys Cys Ser Gly 

305 35 40 45 

306 



PAGE: 7 RAW SEQUENCE LISTING DATE: 02/04/94 

PATENT APPLICATION US/08/083,590A TIME: 11:56:24 

INPUT SET: SI 376. raw 



307 Glu Ser Asp Gly Ala Thr Gly Lys Cys Leu Gly Ser Cys Lys Thr Arg 

308 50 55 60 
309 

310 Phe Arg Val Cys Leu Lys His Tyr Gin Ala Thr lie Asp Thr Thr Ser 

311 65 70 75 80 
312 

313 Gin Cys Thr Tyr Gly Asp Val lie Thr Pro lie Leu Gly Glu Asn Ser 

314 85 90 95 
315 

316 Val Asn Leu Thr Asp Ala Gin Arg Phe Gin Asn Lys Gly Phe Thr Asn 

317 100 105 110 
318 

319 Pro lie Gin Phe Pro Phe Ser Phe Ser Trp Pro Gly Thr Phe Ser Leu 

320 115 120 125 
321 

322 lie Val Glu Ala Trp His Asp Thr Asn Asn Ser Gly Asn Ala Arg Thr 

323 130 135 140 
324 

325 Asn Lys Leu Leu lie Gin Arg Leu Leu Val Gin Gin Val Leu Glu Val 

326 145 150 155 160 
327 

328 Ser Ser Glu Trp Lys Thr Asn Lys Ser Glu Ser Gin Tyr Thr Ser Leu 

329 165 170 175 
330 

331 Glu Tyr Asp Phe Arg Val Thr Cys Asp Leu Asn Tyr Tyr Gly Ser Gly 

332 180 185 190 
333 

334 Cys Ala Lys Phe Cys Arg Pro Arg Asp Asp Ser Phe Gly His Ser Thr 

335 195 200 205 
336 

337 Cys Ser Glu Thr Gly Glu lie lie Cys Leu Thr Gly Trp Gin Gly Asp 

338 210 215 220 
339 

340 Tyr Cys His lie Pro Lys Cys Ala Lys Gly Cys Glu His Gly His Cys 

341 225 230 235 240 
342 

343 Asp Lys Pro Asn Gin Cys Val Cys Gin Leu Gly Trp Lys Gly Ala Leu 

344 245 250 255 
345 

346 Cys Asn Glu Cys Val Leu Glu Pro Asn Cys lie His Gly Thr Cys Asn 

347 260 265 270 
348 

349 Lys Pro Trp Thr Cys lie Cys Asn Glu Gly Trp Gly Gly Leu Tyr Cys 

350 275 280 285 
351 

352 Asn Gin Asp Leu Asn Tyr Cys Thr Asn His Arg Pro Cys Lys Asn Gly 

353 290 295 300 
354 

355 Gly Thr Cys Phe Asn Thr Gly Glu Gly Leu Tyr Thr Cys Lys Cys Ala 

356 305 310 315 320 
357 



PAGE: 8 



358 
359 
360 
361 
362 
363 
364 
365 
366 
367 
368 
369 
370 
371 
372 
373 
374 
375 
376 
377 
378 
379 
380 
381 
382 
383 
384 
385 
386 
387 
388 
389 
390 
391 
392 
393 
394 
395 
396 
397 
398 
399 
400 
401 
402 
403 
404 
405 
406 
407 
408 



RAW SEQUENCE LISTING DATE: 02/04/94 

PATENT APPLICATION US/08/083, 590A TIME: 11:56:29 

INPUT SET: S1376.raw 



Pro Gly Tyr Ser 



Ala Asp Val Asn 
340 

His Thr Lys Thr 
355 

Lys Met Cys Glu 
370 

Gin Gly He Cys 
385 

Gly Tyr Gin Cys 



Leu Gin Leu Asp 
420 

Cys Gin Pro Ser 
435 



Gly Asp Asp Cys 
325 

Pro Cys Gin Asn 



Gly Tyr Lys Cys 
360 

Glu Lys Val Leu 
375 

Arg Asn Val Arg 
390 

Glu Cys Pro He 
405 

Asn Cys Ser Pro 



Gly Lys Cys He 
440 



Glu Asn Glu He 
330 

Gly Gly Thr Cys 
345 

His Cys Ala Asn 



Thr Cys Ser Asp 
380 

Pro Gly Leu Gly 
395 

Gly Tyr Ser Gly 
410 

Asn Pro Cys He 
425 

Cys Pro Ala Gly 



Tyr Ser Cys Asp 
335 

He Asp Glu Pro 
350 

Gly Trp Ser Gly 
365 

Lys Pro Cys His 



Ser Lys Gly Gin 
400 

Pro Asn Cys Asp 
415 

Asn Gly Gly Ser 
430 

Phe Ser Gly Thr 
445 



Arg Cys Glu Thr 
450 

Gly Gly Thr Cys 
465 

Pro Gly Phe His 



He Arg Pro Cys 
500 

Tyr Gin Cys Thr 
515 

Asp He Asp Glu 
530 

Met Asn Arg Val 
545 

Gly Lys Gin Cys 



His Gin Tyr Gly 
580 



Asn He Asp Asp 
455 

He Asp Met Val 
470 

Gly Thr His Cys 
485 

Ala Asn Gly Gly 



Cys Arg Ala Gly 
520 

Cys Ser Ser Gly 
535 

Asn Ser Phe Glu 
550 

Asp Glu Glu Ser 
565 

Ala Thr Thr Gin 



Cys Leu Gly His 
460 

Asn Gin Tyr Arg 
475 

Ser Ser Lys Val 
490 

Thr Cys Leu Asn 
505 

Phe Thr Gly Lys 



Pro Cys His Asn 
540 

Cys Val Cys Ala 
555 

Tyr Asp Ser Val 
570 

Ala Arg Ala Asp 
585 



Gin Cys Glu Asn 



Cys Gin Cys Val 
480 

Asp Leu Cys Leu 
495 

Leu Asn Asn Asp 
510 

Asp Cys Ser Val 
525 

Gly Gly Thr Cys 



Asn Gly Phe Arg 
560 

Thr Phe Asp Ala 
575 

Gly Leu Thr Asn 
590 



PAGE: 9 RAW SEQUENCE LISTING DATE: 02/04/94 

PATENT APPLICATION US/08/083,590A TIME: 11:56:34 

INPUT SET: S1376.raw 



409 Ala Gin Val Val Leu lie Ala Val Phe Ser Val Ala Met Pro Leu Val 

410 595 600 605 
411 

412 Ala Val lie Ala Ala Cys Val Val Phe Cys Met Lys Arg Lys Arg Lys 

413 610 615 620 
414 

415 Arg Ala Gin Glu Lys Asp Asp Ala Glu Ala Arg Lys Gin Asn Glu Gin 

416 625 630 635 640 
417 

418 Asn Ala Val Ala Thr Met His His Asn Gly Ser Gly Val Gly Val Ala 

419 645 650 655 
420 

421 Leu Ala Ser Ala Ser Leu Gly Gly Lys Thr Gly Ser Asn Ser Gly Leu 

422 660 665 670 
423 

424 Thr Phe Asp Gly Gly Asn Pro Asn lie lie Lys Asn Thr Trp Asp Lys 

425 675 680 685 
426 

427 Ser Val Asn Asn lie Cys Ala Ser Ala Ala Ala Ala Ala Ala Ala Ala 

428 690 695 700 
429 

430 Ala Ala Ala Asp Glu Cys Leu Met Tyr Gly Gly Tyr Val Ala Ser Val 

431 705 710 715 720 
432 

433 Ala Asp Asn Asn Asn Ala Asn Ser Asp Phe Cys Val Ala Pro Leu Gin 

434 725 730 735 
435 

436 Arg Ala Lys Ser Gin Lys Gin Leu Asn Thr Asp Pro Thr Leu Met His 

437 740 745 750 
438 

43 9 Arg Gly Ser Pro Ala Gly Ser Ser Ala Lys Gly Ala Ser Gly Gly Gly 

440 755 760 765 

441 

442 Pro Gly Ala Ala Glu Gly Lys Arg lie Ser Val Leu Gly Glu Gly Ser 

443 770 775 780 
444 

445 Tyr Cys Ser Gin Arg Trp Pro Ser Leu Ala Ala Ala Gly Val Ala Gly 

446 785 790 795 800 
447 

448 Ala Cys Ser Ser Gin Leu Met Ala Ala Ala Ser Ala Ala Gly Ser Gly 

449 805 810 815 
450 

451 Ala Gly Thr Ala Gin Gin Gin Arg Ser Val Val Cys Gly Thr Pro His 

452 820 825 830 
453 

454 Met 

455 

456 

457 (2) INFORMATION FOR SEQ ID NO:3: 
458 

459 (i) SEQUENCE CHARACTERISTICS: 
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RAW SEQUENCE LISTING DATE: 02/04/94 

PATENT APPLICATION US/08/083, 590A TIME: 11:56:40 

INPUT SET: S1376.raw 



(A) LENGTH: 1320 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 442.. 1320 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CCGAGTCGAG CGCCGTGCTT CGAGCGGTGA TGAGCCCCTT TTCTGTCAAC GCTAAAGATC 60 

TACAAAACAT CAGCGCCTAT CAAGTGGAAG TGTCAAGTGT GAACAAAACA AAAACGAGAG 120 

AAGCACATAC TAAGGTCCAT ATAAATAATA AATAATAATT GTGTGTGATA ACAACATTAT 180 

CCAAACAAAA CCAAACAAAA CGAAGGCAAA GTGGAGAAAA TGATACAGCA TCCAGAGTAC 240 

GGCCGTTATT CAGCTATCCA GAGCAAGTGT AGTGTGGCAA AATAGAAACA AACAAAGGCA 300 

CCAAAATCTG CATACATGGG CTAATTAAGG CTGCCCAGCG AATTTACATT TGTGTGGTGC 360 

CAATCCAGAG TGAATCCGAA ACAAACTCCA TCTAGATCGC CAACCAGCAT CACGCTCGCA 420 

AACGCCCCCA GAATGTACAA A ATG TTT AGG AAA CAT TTT CGG CGA AAA CCA 471 

Met Phe Arg Lys His Phe Arg Arg Lys Pro 
15 10 

GCT ACG TCG TCG TCG TTG GAG TCA ACA ATA GAA TCA GCA GAC AGC CTG 519 
Ala Thr Ser Ser Ser Leu Glu Ser Thr lie Glu Ser Ala Asp Ser Leu 
15 20 25 

GGA ATG TCC AAG AAG ACG GCG ACA AAA AGG CAG CGT CCG AGG CAT CGG 567 
Gly Met Ser Lys Lys Thr Ala Thr Lys Arg Gin Arg Pro Arg His Arg 
30 35 40 

GTA CCC AAA ATC GCG ACC CTG CCA TCG ACG ATC CGC GAT TGT CGA TCA 615 
Val Pro Lys lie Ala Thr Leu Pro Ser Thr lie Arg Asp Cys Arg Ser 
45 50 55 

TTA AAG TCT GCC TGC AAC TTA ATT GCT TTA ATT TTA ATA CTG TTA GTC 663 
Leu Lys Ser Ala Cys Asn Leu lie. Ala Leu lie Leu lie Leu Leu Val 
60 65 70 

CAT AAG ATA TCC GCA GCT GGT AAC TTC GAG CTG GAA ATA TTA GAA ATC 711 



• 



PAGE: 11 RAW SEQUENCE LISTING DATE: 02/04/94 

PATENT APPLICATION US/08/083, 590A TIME: 11:56:45 

INPUT SET: S1376.raw 

511 His Lys lie Ser Ala Ala Gly Asn Phe Glu Leu Glu lie Leu Glu lie 

512 75 80 85 90 
513 

514 TCA AAT ACC AAC AGC CAT CTA CTC AAC GGC TAT TGC TGC GGC ATG CCA 759 

515 Ser Asn Thr Asn Ser His Leu Leu Asn Gly Tyr Cys Cys Gly Met Pro 

516 95 100 105 
517 

518 GCG GAA CTT AGG GCC ACC AAG ACG ATA GGC TGC TCG CCA TGC ACG ACG 807 

519 Ala Glu Leu Arg Ala Thr Lys Thr lie Gly Cys Ser Pro Cys Thr Thr 

520 110 115 120 
521 

522 GCA TTC CGG CTG TGC CTG AAG GAG TAC CAG ACC ACG GAG CAG GGT GCC 8 55 

523 Ala Phe Arg Leu Cys Leu Lys Glu Tyr Gin Thr Thr Glu Gin Gly Ala 

524 125 130 135 
525 

525 AGC ATA TCC ACG GGC TGT TCG TTT GGC AAC GCC ACC ACC AAG ATA CTG 903 

527 Ser lie Ser Thr Gly Cys Ser Phe Gly Asn Ala Thr Thr Lys lie Leu 

528 140 145 150 
529 

530 GGT GGC TCC AGC TTT GTG CTC AGC GAT CCG GGT GTG GGA GCC ATT GTG 951 

531 Gly Gly Ser Ser Phe Val Leu Ser Asp Pro Gly Val Gly Ala He Val 

532 155 160 165 170 
533 

534 CTG CCC TTT ACG TTT CGT TGG ACG AAG TCG TTT ACG CTG ATA CTG CAG 999 

535 Leu Pro Phe Thr Phe Arg Trp Thr Lys Ser Phe Thr Leu He Leu Gin 

536 175 180 185 
537 

538 GCG TTG GAT ATG TAC AAC ACA TCC TAT CCA GAT GCG GAG AGG TTA ATT 1047 

539 Ala Leu Asp Met Tyr Asn Thr Ser Tyr Pro Asp Ala Glu Arg Leu He 

540 190 195 200 
541 

542 GAG GAA ACA TCA TAC TCG GGC GTG ATA CTG CCG TCG CCG GAG TGG AAG 1095 

543 Glu Glu Thr Ser Tyr Ser Gly Val He Leu Pro Ser Pro Glu Trp Lys 

544 205 210 215 
545 

546 ACG CTG GAC CAC ATC GGG CGG AAC GCG CGG ATC ACC TAC CGT GTC CGG 1143 

547 Thr Leu Asp His He Gly Arg Asn Ala Arg He Thr Tyr Arg Val Arg 

548 220 225 230 
549 

550 GTG CAA TGC GCC GTT ACC TAC TAC AAC ACG ACC TGC ACG ACC TTC TGC 1191 

551 Val Gin Cys Ala Val Thr Tyr Tyr Asn Thr Thr Cys Thr Thr Phe Cys 

552 235 240 245 250 
553 

554 CGT CCG CGG GAC GAT CAG TTC GGT CAC TAC GCC TGC GGC TCC GAG GGT 1239 

555 Arg Pro Arg Asp Asp Gin Phe Gly His Tyr Ala Cys Gly Ser Glu Gly 

556 255 260 265 
557 

558 CAG AAG CTC TGC CTG AAT GGC TGG CAG GGC GTC AAC TGC GAG GAG GCC 12 87 

559 Gin Lys Leu Cys Leu Asn Gly Trp Gin Gly Val Asn Cys Glu Glu Ala 

560 270 275 280 
561 
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562 ATA TGC AAG GCG GGC TGC GAC CCC GTC CAC GGC 132 0 

563 lie Cys Lys Ala Gly Cys Asp Pro Val His Gly 

564 285 290 
565 

566 

567 (2) INFORMATION FOR SEQ ID NO : 4 : 
568 

569 (i) SEQUENCE CHARACTERISTICS: 

570 (A) LENGTH: 293 amino acids 

571 (B) TYPE: amino acid 

572 (D) TOPOLOGY: unknown 
573 

574 (ii) MOLECULE TYPE: protein 

575 

576 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

577 

578 Met Phe Arg Lys His Phe Arg Arg Lys Pro Ala Thr Ser Ser Ser Leu 

579 15 10 15 
580 

581 Glu Ser Thr lie Glu Ser Ala Asp Ser Leu Gly Met Ser Lys Lys Thr 

582 20 25 30 
583 

584 Ala Thr Lys Arg Gin Arg Pro Arg His Arg Val Pro Lys lie Ala Thr 

585 35 40 45 
586 

587 Leu Pro Ser Thr lie Arg Asp Cys Arg Ser Leu Lys Ser Ala Cys Asn 

588 50 55 60 
589 

590 Leu lie Ala Leu lie Leu lie Leu Leu Val His Lys lie Ser Ala Ala 

591 65 70 75 80 
592 

593 Gly Asn Phe Glu Leu Glu lie Leu Glu lie Ser Asn Thr Asn Ser His 

594 85 90 95 
595 

596 Leu Leu Asn Gly Tyr Cys Cys Gly Met Pro Ala Glu Leu Arg Ala Thr 

597 100 105 110 
598 

599 Lys Thr lie Gly Cys Ser Pro Cys Thr Thr Ala Phe Arg Leu Cys Leu 

600 115 120 125 
601 

602 Lys Glu Tyr Gin Thr Thr Glu Gin Gly Ala Ser lie Ser Thr Gly Cys 

603 130 135 140 
604 

605 Ser Phe Gly Asn Ala Thr Thr Lys lie Leu Gly Gly Ser Ser Phe Val 

606 145 150 155 160 
607 

608 Leu Ser Asp Pro Gly Val Gly Ala lie Val Leu Pro Phe Thr Phe Arg 

609 165 170 175 
610 

611 Trp Thr Lys Ser Phe Thr Leu lie Leu Gin Ala Leu Asp Met Tyr Asn 

612 180 185 190 
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613 

614 Thr Ser Tyr Pro Asp Ala Glu Arg Leu lie Glu Glu Thr Ser Tyr Ser 

615 195 200 205 
616 

617 Gly Val lie Leu Pro Ser Pro Glu Trp Lys Thr Leu Asp His lie Gly 

618 210 215 220 
619 

62 0 Arg Asn Ala Arg lie Thr Tyr Arg Val Arg Val Gin Cys Ala Val Thr 
621 225 230 235 240 

622 

623 Tyr Tyr Asn Thr Thr Cys Thr Thr Phe Cys Arg Pro Arg Asp Asp Gin 

624 245 250 255 
625 

626 Phe Gly His Tyr Ala Cys Gly Ser Glu Gly Gin Lys Leu Cys Leu Asn 

627 260 265 270 
628 

62 9 Gly Trp Gin Gly Val Asn Cys Glu Glu Ala He Cys Lys Ala Gly Cys 
630 275 280 285 

631 

632 Asp Pro Val His Gly 

633 290 
634 

635 

636 (2) INFORMATION FOR SEQ ID NO: 5: 
637 

63 8 (i) SEQUENCE CHARACTERISTICS: 
63 9 (A) LENGTH: 267 base pairs 

640 (B) TYPE: nucleic acid 

641 (C) STRANDEDNESS : double 

642 (D) TOPOLOGY: unknown 
643 

644 (ii) MOLECULE TYPE: cDNA 

645 

646 

647 

648 <xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

649 

650 CGGTGGACTT CCTTCGTGTA TTGGTGGGAG CCCTCGGGAA CGGGGGGTAA CACTGAAAGG 6 0 

651 

652 TCGAGTACCC ATTTCCGTCA TAACGGGTTG GTCGCCCCCT AGGGGTCGGA GTCAGGTGGA 12 0 

653 

654 CGGGAGGTCG ACAACGCCCG GGGGACGGGT GGTACATGGT GTAAGGTCTT TACCGGACCG 18 0 

655 

656 GGCAAACGGG TCACACCGAA AGGGGTGAAC GGTAACTACG GGGTCGTCCT GCCCGTCCAT 24 0 

657 

658 CGAGTCTGGT AAGAGGGTCG CCTTAAG 26 7 

659 

660 (2) INFORMATION FOR SEQ ID NO : 6 : 
661 

662 (i) SEQUENCE CHARACTERISTICS: 

663 (A) LENGTH: 574 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

GAATTCCTTC CATTATACGT GACTTTTCTG AAACTGTAGC CACCCTAGTG TCTCTAACTC 60 

CCTCTGGAGT TTGTCAGCTT TGGTCTTTTC AAAGAGCAGG CTCTCTTCAA GCTCCTTAAT 120 

GCGGGCATGC TCCAGTTTGG TCTGCGTCTC AAGATCACCT TTGGTAATTG ATTCTTCTTC 180 

AACCCGGAAC TGAAGGCTGG CTCTCACCCT CTAGGCAGAG CAGGAATTCC GAGGTGGATG 240 

TGTTAGATGT GAATGTCCGT GGCCCAGATG GCTGCACCCC ATTGATGTTG GCTTCTCTCC 3 00 

GAGGAGGCAG CTCAGATTTG AGTGATGAAG ATGAAGATGC AGAGGACTGT TCTGCTAACA 3 60 

TCATCACAGA CTTGGTCTAC CAGGGTGCCA GCCTCCAGNC CAGACAGACC GGACTGGTGA 42 0 

GATGGCCCTG CACCTTGCAG CCCGCTACTC ACGGGCTGAT GCTGCCAAGC GTCTCCTGGA 480 

TGCAGGTGCA GATGCCAATG CCCAGGACAA CATGGGCCGC TGTCCACTCC ATGCTGCAGT 540 

GGCACGTGAT GCCAAGGTGT ATTCAGATCT GTTA 574 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 295 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

TCCAGATTCT GATTCGCAAC CGAGTAACTG ATCTAGATGC CAGGATGAAT GATGGTACTA 60 

CACCCCTGAT CCTGGCTGCC CGCCTGGCTG TGGAGGGAAT GGTGGCAGAA CTGATCAACT 120 

GCCAAGCGGA TGTGAATGCA GTGGATGACC ATGGAAAATC TGCTCTTCAC TGGGCAGCTG 180 
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CTGTCAATAA TGTGGAGGCA ACTCTTTTGT TGTTGAAAAA TGGGGCCAAC CGAGACATGC 240 
AGGACAACAA GGAAGAGACA CCTCTGTTTC TTGCTGCCCG GGAGGAGCTA TAAGC 295 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 248 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

GAATTCCATT CAGGAGGAAA GGGTGGGGAG AGAAGCAGGC ACCCACTTTC CCGTGGCTGG 60 

ACTCGTTCCC AGGTGGCTCC ACCGGCAGCT GTGACCGCCG CAGGTGGGGG CGGAGTGCCA 120 

TTCAGAAAAT TCCAGAAAAG CCCTACCCCA ACTCGGACGG CAACGTCACA CCCGTGGGTA 180 

GCAACTGGCA CACAAACAGC CAGCGTGTCT GGGGCACGGG GGGATGGCAC CCCCTGCAGG 240 



CAGAGCTG 248 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

TACGTATCTC GAGCACAGAC AGCTGACGTA CACTTTTNNA GTGCGAGGGA CATTCGTCCG 60 

ACCAGTACGA ACATTTAGGC TCAGTACGGT AGGTCCATGG CCAAGACTAG GAGACGTAGG 120 

GAGCTACAGG TCCCGCTCGC TAAACTCGGA CCACTGAAAC CTCCGGTCGA CAGTCGGTAA 180 

GCGAACAAGA GGGCCAGATC TTAGAGAAGG TGTCGCGGCG AGACTCGGGC TCGGGTCAGG 240 
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766 

767 CGGCCTTAAG GACGTCGGGC CCNNNAGGTG ATCAAGATCT CGNCNCGGCG GGCGCCACCT 3 00 

768 

769 CGAGGNCGAA AACAAGGGAA ATC 323 

770 

771 



772 (2) INFORMATION FOR SEQ ID NO: 10: 
773 

774 (i) SEQUENCE CHARACTERISTICS: 

775 (A) LENGTH: 3234 base pairs 

776 (B) TYPE: nucleic acid 

777 (C) STRANDEDNESS : double 

778 (D) TOPOLOGY: unknown 
779 

780 (ii) MOLECULE TYPE: cDNA 

781 

782 

783 (ix) FEATURE: 

784 (A) NAME/KEY: CDS 

785 (B) LOCATION: 1..3234 
786 

787 

788 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

789 

790 TGC CAG GAG GAC GCG GGC AAC AAG GTC TGC AGC CTG CAG TGC AAC AAC 48 

791 Cys Gin Glu Asp Ala Gly Asn Lys Val Cys Ser Leu Gin Cys Asn Asn 

792 15 10 15 
793 

794 CAC GCG TGC GGC TGG GAC GGC GGT GAC TGC TCC CTC AAC TTC AAT GAC 96 

795 His Ala Cys Gly Trp Asp Gly Gly Asp Cys Ser Leu Asn Phe Asn Asp 

796 20 25 30 
797 

798 CCC TGG AAG AAC TGC ACG CAG TCT CTG CAG TGC TGG AAG TAC TTC AGT 144 

799 Pro Trp Lys Asn Cys Thr Gin Ser Leu Gin Cys Trp Lys Tyr Phe Ser 

800 35 40 45 
801 

802 GAC GGC CAC TGT GAC AGC CAG TGC AAC TCA GCC GGC TGC CTC TTC GAC 192 

803 Asp Gly His Cys Asp Ser Gin Cys Asn Ser Ala Gly Cys Leu Phe Asp 

804 50 55 60 
805 

806 GGC TTT GAC TGC CAG CGT GCG GAA GGC CAG TGC AAC CCC CTG TAC GAC 240 

807 Gly Phe Asp Cys Gin Arg Ala Glu Gly Gin Cys Asn Pro Leu Tyr Asp 

808 65 70 75 80 
809 

810 CAG TAC TGC AAG GAC CAC TTC AGC GAC GGG CAC TGC GAC CAG GGC TGC 288 

811 Gin Tyr Cys Lys Asp His Phe Ser Asp Gly His Cys Asp Gin Gly Cys 

812 85 90 95 
813 

814 AAC AGC GCG GAG TGC GAG TGG GAC GGG CTG GAC TGT GCG GAG CAT GTA 336 

815 Asn Ser Ala Glu Cys Glu Trp Asp Gly Leu Asp Cys Ala Glu His Val 

816 100 105 110 
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817 

818 CCC GAG AGG CTG GCG GCC GGC ACG CTG GTG GTG GTG GTG CTG ATG CCG 384 

819 Pro Glu Arg Leu Ala Ala Gly Thr Leu Val Val Val Val Leu Met Pro 

820 115 120 125 
821 

822 CCG GAG CAG CTG CGC AAC AGC TCC TTC CAC TTC CTG CGG GAG CTC AGC 432 

823 Pro Glu Gin Leu Arg Asn Ser Ser Phe His Phe Leu Arg Glu Leu Ser 

824 130 135 140 
825 

826 CGC GTG CTG CAC ACC AAC GTG GTC TTC AAG CGT GAC GCA CAC GGC CAG 480 

827 Arg Val Leu His Thr Asn Val Val Phe Lys Arg Asp Ala His Gly Gin 

828 145 150 155 160 
829 

830 CAG ATG ATC TTC CCC TAC TAC GGC CGC GAG GAG GAG CTG CGC AAG CAC 528 

831 Gin Met lie Phe Pro Tyr Tyr Gly Arg Glu Glu Glu Leu Arg Lys His 

832 165 170 175 
833 

834 CCC ATC AAG CGT GCC GCC GAG GGC TGG GCC GCA CCT GAC GCC CTG CTG 576 

835 Pro lie Lys Arg Ala Ala Glu Gly Trp Ala Ala Pro Asp Ala Leu Leu 

836 180 185 190 
837 

838 GGC CAG GTG AAG GCC TCG CTG CTC CCT GGT GGC AGC GAG GGT GGG CGG 624 

839 Gly Gin Val Lys Ala Ser Leu Leu Pro Gly Gly Ser Glu Gly Gly Arg 

840 195 200 205 
841 

842 CGG CGG AGG GAG CTG GAC CCC ATG GAC GTC CGC GGC TCC ATC GTC TAC 672 

843 Arg Arg Arg Glu Leu Asp Pro Met Asp Val Arg Gly Ser lie Val Tyr 

844 210 215 220 
845 

846 CTG GAG ATT GAC AAC CGG CAG TGT GTG CAG GCC TCC TCG CAG TGC TTC 720 

847 Leu Glu lie Asp Asn Arg Gin Cys Val Gin Ala Ser Ser Gin Cys Phe 

848 225 230 235 240 
849 

850 CAG AGT GCC ACC GAC GTG GCC GCA TTC CTG GGA GCG CTC GCC TCG CTG 768 

851 Gin Ser Ala Thr Asp Val Ala Ala Phe Leu Gly Ala Leu Ala Ser Leu 

852 245 250 255 
853 

854 GGC AGC CTC AAC ATC CCC TAC AAG ATC GAG GCC GTG CAG AGT GAG ACC 816 

855 Gly Ser Leu Asn lie Pro Tyr Lys lie Glu Ala Val Gin Ser Glu Thr 

856 260 265 270 
857 

858 GTG GAG CCG CCC CCG CCG GCG CAG CTG CAC TTC ATG TAC GTG GCG GCG 864 

859 Val Glu Pro Pro Pro Pro Ala Gin Leu His Phe Met Tyr Val Ala Ala 

860 275 280 285 
861 

862 GCC GCC TTT GTG CTT CTG TTC TTC GTG GGC TGC GGG GTG CTG CTG TCC 912 

863 Ala Ala Phe Val Leu Leu Phe Phe Val Gly Cys Gly Val Leu Leu Ser 

864 290 295 300 
865 

866 CGC AAG CGC CGG CGG CAG CAT GGC CAG CTC TGG TTC CCT GAG GGC TTC 960 

867 Arg Lys Arg Arg Arg Gin His Gly Gin Leu Trp Phe Pro Glu Gly Phe 
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868 305 310 315 320 

869 

870 AAA GTG TCT GAG GCC AGC AAG AAG AAG CGG CGG GAG CCC CTC GGC GAG 1008 

871 Lys Val Ser Glu Ala Ser Lys Lys Lys Arg Arg Glu Pro Leu Gly Glu 

872 325 330 335 
873 

874 GAC TCC GTG GGC CTC AAG CCC CTG AAG AAC GCT TCA GAC GGT GCC CTC 1056 

875 Asp Ser Val Gly Leu Lys Pro Leu Lys Asn Ala Ser Asp Gly Ala Leu 

876 340 345 350 
877 

878 ATG GAC GAC AAC CAG AAT GAG TGG GGG GAC GAG GAC CTG GAG ACC AAG 1104 

879 Met Asp Asp Asn Gin Asn Glu Trp Gly Asp Glu Asp Leu Glu Thr Lys 

880 355 360 365 
881 

882 AAG TTC CGG TTC GAG GAG CCC GTG GTT CTG CCT GAC CTG GAC GAC CAG 1152 

883 Lys Phe Arg Phe Glu Glu Pro Val Val Leu Pro Asp Leu Asp Asp Gin 

884 370 375 380 
885 

886 ACA GAC CAC CGG CAG TGG ACT CAG CAG CAC CTG GAT GCC GCT GAC CTG 1200 

887 Thr Asp His Arg Gin Trp Thr Gin Gin His Leu Asp Ala Ala Asp Leu 

888 385 390 395 400 
889 

890 CGC ATG TCT GCC ATG GCC CCC ACA CCG CCC CAG GGT GAG GTT GAC GCC 1248 

891 Arg Met Ser Ala Met Ala Pro Thr Pro Pro Gin Gly Glu Val Asp Ala 

892 405 410 415 
893 

894 GAC TGC ATG GAC GTC AAT GTC CGC GGG CCT GAT GGC TTC ACC CCG CTC 12 96 

895 Asp Cys Met Asp Val Asn Val Arg Gly Pro Asp Gly Phe Thr Pro Leu 

896 420 425 430 
897 

898 ATG ATC GCC TCC TGC AGC GGG GGC GGC CTG GAG ACG GGC AAC AGC GAG 1344 

899 Met lie Ala Ser Cys Ser Gly Gly Gly Leu Glu Thr Gly Asn Ser Glu 

900 435 440 445 
901 

902 GAA GAG GAG GAC GCG CCG GCC GTC ATC TCC GAC TTC ATC TAC CAG GGC 1392 

903 Glu Glu Glu Asp Ala Pro Ala Val lie Ser Asp Phe lie Tyr Gin Gly 

904 450 455 460 
905 

906 GCC AGC CTG CAC AAC CAG ACA GAC CGC ACG GGC GAG ACC GCC TTG CAC 1440 

907 Ala Ser Leu His Asn Gin Thr Asp Arg Thr Gly Glu Thr Ala Leu His 

908 465 470 475 480 
909 

910 CTG GCC GCC CGC TAC TCA CGC TCT GAT GCC GCC AAG CGC CTG CTG GAG 1488 

911 Leu Ala Ala Arg Tyr Ser Arg Ser Asp Ala Ala Lys Arg Leu Leu Glu 

912 485 490 495 
913 

914 GCC AGC GCA GAT GCC AAC ATC CAG GAC AAC ATG GGC CGC ACC CCG CTG 1536 

915 Ala Ser Ala Asp Ala Asn lie Gin Asp Asn Met Gly Arg Thr Pro Leu 

916 500 505 510 
917 

918 CAT GCG GCT GTG TCT GCC GAC GCA CAA GGT GTC TTC CAG ATC CTG ATC 1584 
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919 His Ala Ala Val Ser Ala Asp Ala Gin Gly Val Phe Gin lie Leu He 

920 515 520 525 
921 

922 CGG AAC CGA GCC ACA GAC CTG GAT GCC CGC ATG CAT GAT GGC ACG ACG 1632 

923 Arg Asn Arg Ala Thr Asp Leu Asp Ala Arg Met His Asp Gly Thr Thr 

924 530 535 540 
925 

926 CCA CTG ATC CTG GCT GCC CGC CTG GCC GTG GAG GGC ATG CTG GAG GAC 1680 

927 Pro Leu He Leu Ala Ala Arg Leu Ala Val Glu Gly Met Leu Glu Asp 

928 545 550 555 560 
929 

93 0 CTC ATC AAC TCA CAC GCC GAC GTC AAC GCC GTA GAT GAC CTG GGC AAG 1728 

931 Leu He Asn Ser His Ala Asp Val Asn Ala Val Asp Asp Leu Gly Lys 

932 565 570 575 
933 

934 TCC GCC CTG CAC TGG GCC GCC GCC GTG AAC AAT GTG GAT GCC GCA GTT 1776 

935 Ser Ala Leu His Trp Ala Ala Ala Val Asn Asn Val Asp Ala Ala Val 

936 580 585 590 
937 

938 GTG CTC CTG AAG AAC GGG GCT AAC AAA GAT ATG CAG AAC AAC AGG GAG 1824 
93 9 Val Leu Leu Lys Asn Gly Ala Asn Lys Asp Met Gin Asn Asn Arg Glu 
940 595 600 605 

941 

942 GAG ACA CCC CTG TTT CTG GCC GCC CGG GAG GGC AGC TAC GAG ACC GCC 1872 

943 Glu Thr Pro Leu Phe Leu Ala Ala Arg Glu Gly Ser Tyr Glu Thr Ala 

944 610 615 620 
945 

946 AAG GTG CTG CTG GAC CAC TTT GCC AAC CGG GAC ATC ACG GAT CAT ATG 192 0 

947 Lys Val Leu Leu Asp His Phe Ala Asn Arg Asp He Thr Asp His Met 

948 625 630 635 640 
949 

950 GAC CGC CTG CCG CGC GAC ATC GCA CAG GAG CGC ATG CAT CAC GAC ATC 1968 

951 Asp Arg Leu Pro Arg Asp He Ala Gin Glu Arg Met His His Asp He 

952 645 650 655 
953 

954 GTG AGG CTG CTG GAC GAG TAC AAC CTG GTG CGC AGC CCG CAG CTG CAC 2016 

955 Val Arg Leu Leu Asp Glu Tyr Asn Leu Val Arg Ser Pro Gin Leu His 

956 660 665 670 
957 

958 GGA GCC CCG CTG GGG GGC ACG CCC ACC CTG TCG CCC CCG CTC TGC TCG 2064 

959 Gly Ala Pro Leu Gly Gly Thr Pro Thr Leu Ser Pro Pro Leu Cys Ser 

960 675 680 685 
961 

962 CCC AAC GGC TAC CTG GGC AGC CTC AAG CCC GGC GTG CAG GGC AAG AAG 2112 

963 Pro Asn Gly Tyr Leu Gly Ser Leu Lys Pro Gly Val Gin Gly Lys Lys 

964 690 695 700 
965 

966 GTC CGC AAG CCC AGC AGC AAA GGC CTG GCC TGT GGA AGC AAG GAG GCC 2160 

967 Val Arg Lys Pro Ser Ser Lys Gly Leu Ala Cys Gly Ser Lys Glu Ala 

968 705 710 715 720 
969 
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970 AAG GAC CTC AAG GCA CGG AGG AAG AAG TCC CAG GAT GGC AAG GGC TGC 22 08 

971 Lys Asp Leu Lys Ala Arg Arg Lys Lys Ser Gin Asp Gly Lys Gly Cys 

972 725 730 735 
973 

974 CTG CTG GAC AGC TCC GGC ATG CTC TCG CCC GTG GAC TCC CTG GAG TCA 22 56 

975 Leu Leu Asp Ser Ser Gly Met Leu Ser Pro Val Asp Ser Leu Glu Ser 

976 740 745 750 
977 

978 CCC CAT GGC TAC CTG TCA GAC GTG GCC TCG CCG CCA CTG CTG CCC TCC 2304 

979 Pro His Gly Tyr Leu Ser Asp Val Ala Ser Pro Pro Leu Leu Pro Ser 

980 755 760 765 
981 

982 CCG TTC CAG CAG TCT CCG TCC GTG CCC CTC AAC CAC CTG CCT GGG ATG 23 52 

983 Pro Phe Gin Gin Ser Pro Ser Val Pro Leu Asn His Leu Pro Gly Met 

984 770 775 780 
985 

986 CCC GAC ACC CAC CTG GGC ATC GGG CAC CTG AAC GTG GCG GCC AAG CCC 24 00 

987 Pro Asp Thr His Leu Gly lie Gly His Leu Asn Val Ala Ala Lys Pro 

988 785 790 795 800 
989 

990 GAG ATG GCG GCG CTG GGT GGG GGC GGC CGG CTG GCC TTT GAG ACT GGC 2448 

991 Glu Met Ala Ala Leu Gly Gly Gly Gly Arg Leu Ala Phe Glu Thr Gly 

992 805 810 815 
993 

994 CCA CCT CGT CTC TCC CAC CTG CCT GTG GCC TCT GGC ACC AGC ACC GTC 24 96 

995 Pro Pro Arg Leu Ser His Leu Pro Val Ala Ser Gly Thr Ser Thr Val 

996 820 825 830 
997 

998 CTG GGC TCC AGC AGC GGA GGG GCC CTG AAT TTC ACT GTG GGC GGG TCC 2544 

999 Leu Gly Ser Ser Ser Gly Gly Ala Leu Asn Phe Thr Val Gly Gly Ser 
1000 835 840 845 

1001 

1002 ACC AGT TTG AAT GGT CAA TGC GAG TGG CTG TCC CGG CTG CAG AGC GGC 25 92 

1003 Thr Ser Leu Asn Gly Gin Cys Glu Trp Leu Ser Arg Leu Gin Ser Gly 

1004 850 855 860 
1005 

1006 ATG GTG CCG AAC CAA TAC AAC CCT CTG CGG GGG AGT GTG GCA CCA GGC 2640 

1007 Met Val Pro Asn Gin Tyr Asn Pro Leu Arg Gly Ser Val Ala Pro Gly 

1008 865 870 875 880 
1009 

1010 CCC CTG AGC ACA CAG GCC CCC TCC CTG CAG CAT GGC ATG GTA GGC CCG 2688 

1011 Pro Leu Ser Thr Gin Ala Pro Ser Leu Gin His Gly Met Val Gly Pro 

1012 885 890 895 
1013 

1014 CTG CAC AGT AGC CTT GCT GCC AGC GCC CTG TCC CAG ATG ATG AGC TAC 2736 

1015 Leu His Ser Ser Leu Ala Ala Ser Ala Leu Ser Gin Met Met Ser Tyr 

1016 900 905 910 
1017 

1018 CAG GGC CTG CCC AGC ACC CGG CTG GCC ACC CAG CCT CAC CTG GTG CAG 27 84 

1019 Gin Gly Leu Pro Ser Thr Arg Leu Ala Thr Gin Pro His Leu Val Gin 

1020 915 920 925 
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1021 

1022 ACC CAG CAG GTG CAG CCA CAA AAC TTA CAG ATG CAG CAG CAG AAC CTG 2832 

1023 Thr Gin Gin Val Gin Pro Gin Asn Leu Gin Met Gin Gin Gin Asn Leu 

1024 930 935 940 
1025 

1026 CAG CCA GCA AAC ATC CAG CAG CAG CAA AGC CTG CAG CCG CCA CCA CCA 2880 

1027 Gin Pro Ala Asn lie Gin Gin Gin Gin Ser Leu Gin Pro Pro Pro Pro 

1028 945 950 955 960 
1029 

1030 CCA CCA CAG CCG CAC CTT GGC GTG AGC TCA GCA GCC AGC GGC CAC CTG 2928 

1031 Pro Pro Gin Pro His Leu Gly Val Ser Ser Ala Ala Ser Gly His Leu 

1032 965 970 975 
1033 

1034 GGC CGG AGC TTC CTG AGT GGA GAG CCG AGC CAG GCA GAC GTG CAG CCA 2 976 

1035 Gly Arg Ser Phe Leu Ser Gly Glu Pro Ser Gin Ala Asp Val Gin Pro 

1036 980 985 990 
1037 

103 8 CTG GGC CCC AGC AGC CTG GCG GTG CAC ACT ATT CTG CCC CAG GAG AGC 3024 
103 9 Leu Gly Pro Ser Ser Leu Ala Val His Thr lie Leu Pro Gin Glu Ser 
1040 995 1000 1005 

1041 

1042 CCC GCC CTG CCC ACG TCG CTG CCA TCC TCG CTG GTC CCA CCC GTG ACC 3 072 

1043 Pro Ala Leu Pro Thr Ser Leu Pro Ser Ser Leu Val Pro Pro Val Thr 

1044 1010 1015 1020 
1045 

1046 GCA GCC CAG TTC CTG ACG CCC CCC TCG CAG CAC AGC TAC TCC TCG CCT 3120 

1047 Ala Ala Gin Phe Leu Thr Pro Pro Ser Gin His Ser Tyr Ser Ser Pro 

1048 1025 1030 1035 1040 
1049 

1050 GTG GAC AAC ACC CCC AGC CAC CAG CTA CAG GTG CCT GTT CCT GTA ATG 3168 

1051 Val Asp Asn Thr Pro Ser His Gin Leu Gin Val Pro Val Pro Val Met 

1052 1045 1050 1055 
1053 

1054 GTA ATG ATC CGA TCT TCG GAT CCT TCT AAA GGC TCA TCA ATT TTG ATC 3216 

1055 Val Met lie Arg Ser Ser Asp Pro Ser Lys Gly Ser Ser lie Leu lie 

1056 1060 1065 1070 
1057 

1058 GAA GCT CCC GAC TCA TGG 3234 

1059 Glu Ala Pro Asp Ser Trp 

1060 1075 
1061 

1062 

1063 (2) INFORMATION FOR SEQ ID NO: 11: 
1064 

1065 (i) SEQUENCE CHARACTERISTICS: 

1066 (A) LENGTH: 1078 amino acids 

1067 (B) TYPE: amino acid 

1068 (D) TOPOLOGY: unknown 
1069 

1070 (ii) MOLECULE TYPE: protein 

1071 
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1072 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

1073 

1074 Cys Gin Glu Asp Ala Gly Asn Lys Val Cys Ser Leu Gin Cys Asn Asn 

1075 15 10 15 
1076 

1077 His Ala Cys Gly Trp Asp Gly Gly Asp Cys Ser Leu Asn Phe Asn Asp 

1078 20 25 30 
1079 

1080 Pro Trp Lys Asn Cys Thr Gin Ser Leu Gin Cys Trp Lys Tyr Phe Ser 

1081 35 40 45 
1082 

1083 Asp Gly His Cys Asp Ser Gin Cys Asn Ser Ala Gly Cys Leu Phe Asp 

1084 50 55 60 
1085 

1086 Gly Phe Asp Cys Gin Arg Ala Glu Gly Gin Cys Asn Pro Leu Tyr Asp 

1087 65 70 75 80 
1088 

1089 Gin Tyr Cys Lys Asp His Phe Ser Asp Gly His Cys Asp Gin Gly Cys 

1090 85 90 95 
1091 

1092 Asn Ser Ala Glu Cys Glu Trp Asp Gly Leu Asp Cys Ala Glu His Val 

1093 100 105 110 
1094 

1095 Pro Glu Arg Leu Ala Ala Gly Thr Leu Val Val Val Val Leu Met Pro 

1096 115 120 125 
1097 

1098 Pro Glu Gin Leu Arg Asn Ser Ser Phe His Phe Leu Arg Glu Leu Ser 

1099 130 135 140 
1100 

1101 Arg Val Leu His Thr Asn Val Val Phe Lys Arg Asp Ala His Gly Gin 

1102 145 150 155 160 
1103 

1104 Gin Met lie Phe Pro Tyr Tyr Gly Arg Glu Glu Glu Leu Arg Lys His 

1105 165 170 175 
1106 

1107 Pro lie Lys Arg Ala Ala Glu Gly Trp Ala Ala Pro Asp Ala Leu Leu 

1108 180 185 190 
1109 

1110 Gly Gin Val Lys Ala Ser Leu Leu Pro Gly Gly Ser Glu Gly Gly Arg 

1111 195 200 205 
1112 

1113 Arg Arg Arg Glu Leu Asp Pro Met Asp Val Arg Gly Ser lie Val Tyr 

1114 210 215 220 
1115 

1116 Leu Glu lie Asp Asn Arg Gin Cys Val Gin Ala Ser Ser Gin Cys Phe 

1117 225 230 235 240 
1118 

1119 Gin Ser Ala Thr Asp Val Ala Ala Phe Leu Gly Ala Leu Ala Ser Leu 

1120 245 250 255 
1121 

1122 Gly Ser Leu Asn lie Pro Tyr Lys lie Glu Ala Val Gin Ser Glu Thr 
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1123 260 265 270 

1124 

1125 Val Glu Pro Pro Pro Pro Ala Gin Leu His Phe Met Tyr Val Ala Ala 

1126 275 280 285 
1127 

1128 Ala Ala Phe Val Leu Leu Phe Phe Val Gly Cys Gly Val Leu Leu Ser 

1129 290 295 300 
1130 

1131 Arg Lys Arg Arg Arg Gin His Gly Gin Leu Trp Phe Pro Glu Gly Phe 

1132 305 310 315 320 
1133 

1134 Lys Val Ser Glu Ala Ser Lys Lys Lys Arg Arg Glu Pro Leu Gly Glu 

1135 325 330 335 
1136 

113 7 Asp Ser Val Gly Leu Lys Pro Leu Lys Asn Ala Ser Asp Gly Ala Leu 

1138 340 345 350 

1139 

1140 Met Asp Asp Asn Gin Asn Glu Trp Gly Asp Glu Asp Leu Glu Thr Lys 

1141 355 360 365 
1142 

1143 Lys Phe Arg Phe Glu Glu Pro Val Val Leu Pro Asp Leu Asp Asp Gin 

1144 370 375 380 
1145 

1146 Thr Asp His Arg Gin Trp Thr Gin Gin His Leu Asp Ala Ala Asp Leu 

1147 385 390 395 400 
1148 

1149 Arg Met Ser Ala Met Ala Pro Thr Pro Pro Gin Gly Glu Val Asp Ala 

1150 405 410 415 
1151 

1152 Asp Cys Met Asp Val Asn Val Arg Gly Pro Asp Gly Phe Thr Pro Leu 

1153 420 425 430 
1154 

1155 Met lie Ala Ser Cys Ser Gly Gly Gly Leu Glu Thr Gly Asn Ser Glu 

1156 435 440 445 
1157 

1158 Glu Glu Glu Asp Ala Pro Ala Val lie Ser Asp Phe lie Tyr Gin Gly 

1159 450 455 460 
1160 

1161 Ala Ser Leu His Asn Gin Thr Asp Arg Thr Gly Glu Thr Ala Leu His 

1162 465 470 475 480 
1163 

1164 Leu Ala Ala Arg Tyr Ser Arg Ser Asp Ala Ala Lys Arg Leu Leu Glu 

1165 485 490 495 
1166 

1167 Ala Ser Ala Asp Ala Asn lie Gin Asp Asn Met Gly Arg Thr Pro Leu 

1168 500 505 510 
1169 

1170 His Ala Ala Val Ser Ala Asp Ala Gin Gly Val Phe Gin lie Leu lie 

1171 515 520 525 
1172 

1173 Arg Asn Arg Ala Thr Asp Leu Asp Ala Arg Met His Asp Gly Thr Thr 



• 
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1174 530 535 540 

1175 

1176 Pro Leu lie Leu Ala Ala Arg Leu Ala Val Glu Gly Met Leu Glu Asp 

1177 545 550 555 560 
1178 

1179 Leu lie Asn Ser His Ala Asp Val Asn Ala Val Asp Asp Leu Gly Lys 

1180 565 570 575 
1181 

1182 Ser Ala Leu His Trp Ala Ala Ala Val Asn Asn Val Asp Ala Ala Val 

1183 580 585 590 
1184 

1185 Val Leu Leu Lys Asn Gly Ala Asn Lys Asp Met Gin Asn Asn Arg Glu 

1186 595 600 605 
1187 

1188 Glu Thr Pro Leu Phe Leu Ala Ala Arg Glu Gly Ser Tyr Glu Thr Ala 

1189 610 615 620 
1190 

1191 Lys Val Leu Leu Asp His Phe Ala Asn Arg Asp lie Thr Asp His Met 

1192 625 630 635 640 
1193 

1194 Asp Arg Leu Pro Arg Asp lie Ala Gin Glu Arg Met His His Asp lie 

1195 645 650 655 
1196 

1197 Val Arg Leu Leu Asp Glu Tyr Asn Leu Val Arg Ser Pro Gin Leu His 

1198 660 665 670 
1199 

1200 Gly Ala Pro Leu Gly Gly Thr Pro Thr Leu Ser Pro Pro Leu Cys Ser 

1201 675 680 685 
1202 

1203 Pro Asn Gly Tyr Leu Gly Ser Leu Lys Pro Gly Val Gin Gly Lys Lys 

1204 690 695 700 
1205 

1206 Val Arg Lys Pro Ser Ser Lys Gly Leu Ala Cys Gly Ser Lys Glu Ala 

1207 705 710 715 720 
1208 

1209 Lys Asp Leu Lys Ala Arg Arg Lys Lys Ser Gin Asp Gly Lys Gly Cys 

1210 725 730 735 
1211 

1212 Leu Leu Asp Ser Ser Gly Met Leu Ser Pro Val Asp Ser Leu Glu Ser 

1213 740 745 750 
1214 

1215 Pro His Gly Tyr Leu Ser Asp Val Ala Ser Pro Pro Leu Leu Pro Ser 

1216 755 760 765 
1217 

1218 Pro Phe Gin Gin Ser Pro Ser Val Pro Leu Asn His Leu Pro Gly Met 

1219 770 775 780 
1220 

1221 Pro Asp Thr His Leu Gly lie Gly His Leu Asn Val Ala Ala Lys Pro 

1222 785 790 795 800 
1223 

1224 Glu Met Ala Ala Leu Gly Gly Gly Gly Arg Leu Ala Phe Glu Thr Gly 
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1225 805 810 815 

1226 

1227 Pro Pro Arg Leu Ser His Leu Pro Val Ala Ser Gly Thr Ser Thr Val 

1228 820 825 830 
1229 

1230 Leu Gly Ser Ser Ser Gly Gly Ala Leu Asn Phe Thr Val Gly Gly Ser 

1231 835 840 845 
1232 

1233 Thr Ser Leu Asn Gly Gin Cys Glu Trp Leu Ser Arg Leu Gin Ser Gly 

1234 850 855 860 
1235 

1236 Met Val Pro Asn Gin Tyr Asn Pro Leu Arg Gly Ser Val Ala Pro Gly 

1237 865 870 875 880 
1238 

1239 Pro Leu Ser Thr Gin Ala Pro Ser Leu Gin His Gly Met Val Gly Pro 

1240 885 890 895 
1241 

1242 Leu His Ser Ser Leu Ala Ala Ser Ala Leu Ser Gin Met Met Ser Tyr 

1243 900 905 910 
1244 

1245 Gin Gly Leu Pro Ser Thr Arg Leu Ala Thr Gin Pro His Leu Val Gin 

1246 915 920 925 
1247 

1248 Thr Gin Gin Val Gin Pro Gin Asn Leu Gin Met Gin Gin Gin Asn Leu 

1249 930 935 940 
1250 

1251 Gin Pro Ala Asn lie Gin Gin Gin Gin Ser Leu Gin Pro Pro Pro Pro 

1252 945 950 955 960 
1253 

1254 Pro Pro Gin Pro His Leu Gly Val Ser Ser Ala Ala Ser Gly His Leu 

1255 965 970 975 
1256 

1257 Gly Arg Ser Phe Leu Ser Gly Glu Pro Ser Gin Ala Asp Val Gin Pro 

1258 980 985 990 
1259 

1260 Leu Gly Pro Ser Ser Leu Ala Val His Thr lie Leu Pro Gin Glu Ser 

1261 995 1000 1005 
1262 

1263 Pro Ala Leu Pro Thr Ser Leu Pro Ser Ser Leu Val Pro Pro Val Thr 

1264 1010 1015 1020 
1265 

1266 Ala Ala Gin Phe Leu Thr Pro Pro Ser Gin His Ser Tyr Ser Ser Pro 

1267 1025 1030 1035 1040 
1268 

1269 Val Asp Asn Thr Pro Ser His Gin Leu Gin Val Pro Val Pro Val Met 

1270 1045 1050 1055 
1271 

12 72 Val Met lie Arg Ser Ser Asp Pro Ser Lys Gly Ser Ser lie Leu lie 

1273 1060 1065 1070 

1274 

12 75 Glu Ala Pro Asp Ser Trp 
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SEQUENCE CHARACTERISTICS: 






















(A) LENGTH: 4268 base 


pairs 




















(B) TYPE: 


nucleic 


acid 
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(C) STRANDEDNESS : 


double 






















(D) TOPOLOGY: 


unknown 
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MOLECULE TYPE: 


cDNA 
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FEATURE : 
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(A) NAME/KEY: 


CDS 




















1291 






(B) LOCATION: 


2 . . 1972 


















1292 


































1293 


































1294 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 












1295 


































1296 


G GAG GTG GAT GTG TTA GAT GTG AAT GTC CGT GGC CCA GAT GGC TGC 


12 97 


Glu Val Asp Val Leu Asp Val Asn Val Arg Gly Pro Asp Gly Cys 


1298 




1 








5 








10 








15 


1299 


































1300 


ACC 


CCA 


TTG 


ATG 


TTG 


GCT 


TCT 


CTC 


CGA 


GGA 


GGC 


AGC 


TCA 


GAT 


TTG 


AGT 


1301 


Thr 


Pro 


Leu 


Met 


Leu 


Ala 


Ser 


Leu 


Arg 


Gly 


Gly 


Ser 


Ser 


Asp 


Leu 


Ser 


1302 










20 










25 










30 




1303 


































1304 


GAT 


GAA 


GAT 


GAA 


GAT 


GCA 


GAG 


GAC 


TCT 


TCT 


GCT 


AAC 


ATC 


ATC 


ACA 


GAC 




Asp 


Glu 


Asp 


Glu 


Asp 


Ala 


Glu 


Asp 


Ser 


Ser 


Ala 


Asn 


He 


He 


Thr 


Asp 


13 06 








35 










40 










45 






1307 


































1308 


TTG 


GTC 


TAC 


CAG 


GGT 


GCC 


AGC 


CTC 


CAG 


GCC 


CAG 


ACA 


GAC 


CGG 


ACT 


GGT 


1309 


Leu 


Val 


Tyr 


Gin 


Gly 


Ala 


Ser 


Leu 


Gin 


Ala 


Gin 


Thr 


Asp 


Arg 


Thr 


Gly 


1310 






50 










55 










60 








1311 


































1312 


GAG 


ATG 


GCC 


CTG 


CAC 


CTT 


GCA 


GCC 


CGC 


TAC 


TCA 


CGG 


GCT 


GAT 


GCT 


GCC 


1313 


Glu 


Met 


Ala 


Leu 


His 


Leu 


Ala 


Ala 


Arg 


Tyr 


Ser 


Arg 


Ala 


Asp 


Ala 


Ala 


1314 




65 










70 










75 










1315 


































1316 


AAG 


CGT 


CTC 


CTG 


GAT 


GCA 


GGT 


GCA 


GAT 


GCC 


AAT 


GCC 


CAG 


GAC 


AAC 


ATG 


1317 


Lys 


Arg 


Leu 


Leu 


Asp 


Ala 


Gly 


Ala 


Asp 


Ala 


Asn 


Ala 


Gin 


Asp 


Asn 


Met 


1318 


80 










85 










90 










95 


1319 


































1320 


GGC 


CGC 


TGT 


CCA 


CTC 


CAT 


GCT 


GCA 


GTG 


GCA 


GCT 


GAT 


GCC 


CAA 


GGT 


GTC 


1321 


Gly 


Arg 


Cys 


Pro 


Leu 


His 


Ala 


Ala 


Val 


Ala 


Ala 


Asp 


Ala 


Gin 


Gly 


Val 


1322 










100 










105 










110 




1323 


































1324 


TTC 


CAG 


ATT 


CTG 


ATT 


CGC 


AAC 


CGA 


GTA 


ACT 


GAT 


CTA 


GAT 


GCC 


AGG 


ATG 


1325 


Phe 


Gin 


He 


Leu 


He 


Arg 


Asn 


Arg 


Val 


Thr 


Asp 


Leu 


Asp 


Ala 


Arg 


Met 


1326 








115 










120 










125 







46 



94 



142 



190 



238 



286 



334 



382 
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1327 

1328 AAT GAT 

1329 Asn Asp 
1330 

1331 

1332 GGA ATG 

1333 Gly Met 

1334 145 
1335 

1336 GAT GAC 

1337 Asp Asp 

1338 160 
1339 

1340 GTG GAG 

1341 Val Glu 
1342 

1343 

1344 CAG GAC 

1345 Gin Asp 
1346 

1347 

1348 AGC TAT 

1349 Ser Tyr 
1350 

1351 

1352 ATC ACA 

1353 lie Thr 

1354 225 
1355 

1356 ATG CAC 

1357 Met His 

1358 240 
1359 

1360 AGC CCT 

1361 Ser Pro 
1362 

1363 

1364 GGG CCC 

1365 Gly Pro 
1366 

1367 

1368 AAG TCT 
136 9 Lys Ser 
1370 
1371 

1372 AAC CTT 
13 73 Asn Leu 
1374 305 
1375 

13 76 TCT CTG 
1377 Ser Leu 



GGT ACT ACA CCC 
Gly Thr Thr Pro 
130 

GTG GCA GAA CTG 
Val Ala Glu Leu 



CAT GGA AAA TCT 
His Gly Lys Ser 
165 

GCA ACT CTT TTG 
Ala Thr Leu Leu 
180 

AAC AAG GAA GAG 
Asn Lys Glu Glu 
195 

GAA GCA GCC AAG 
Glu Ala Ala Lys 
210 

GAC CAT ATG GAT 
Asp His Met Asp 



CAT GAC ATT GTG 
His Asp lie Val 
245 

CCA GGC ACC GTG 
Pro Gly Thr Val 
260 

AAC AGA TCT TTC 
Asn Arg Ser Phe 
275 

AGA CGG CCC AGT 
Arg Arg Pro Ser 
290 

GCC AAG GAG GCA 
Ala Lys Glu Ala 



AGT GAG AAG GTC 
Ser Glu Lys Val 



CTG ATC CTG GCT 
Leu lie Leu Ala 
135 

ATC AAC TGC CAA 
lie Asn Cys Gin 
150 

GCT CTT CAC TGG 
Ala Leu His Trp 



TTG TTG AAA AAT 
Leu Leu Lys Asn 
185 

ACA CCT CTG TTT 
Thr Pro Leu Phe 
200 

ATC CTG TTA GAC 
lie Leu Leu Asp 
215 

CGT CTT CCC CGG 
Arg Leu Pro Arg 
230 

CGC CTT CTG GAT 
Arg Leu Leu Asp 



TTG ACT TCT GCT 
Leu Thr Ser Ala 
265 

CTC AGC CTG AAG 
Leu Ser Leu Lys 
280 

GCC AAG AGT ACC 
Ala Lys Ser Thr 
295 

AAG GAT GCC AAG 
Lys Asp Ala Lys 
310 

CAA CTG TCT GAG 
Gin Leu Ser Glu 



GCC CGC CTG GCT 
Ala Arg Leu Ala 
140 

GCG GAT GTG AAT 
Ala Asp Val Asn 
155 

GCA GCT GCT GTC 
Ala Ala Ala Val 
170 

GGG GCC AAC CGA 
Gly Ala Asn Arg 



CTT GCT GCC CGG 
Leu Ala Ala Arg 
205 

CAT TTT GCC AAT 
His Phe Ala Asn 
220 

GAT GTG GCT CGG 
Asp Val Ala Arg 
235 

GAA TAC AAT GTG 
Glu Tyr Asn Val 
250 

CTC TCA CCT GTC 
Leu Ser Pro Val 



CAC ACC CCA ATG 
His Thr Pro Met 
285 

ATG CCT ACT AGC 
Met Pro Thr Ser 
300 

GGT AGT AGG AGG 
Gly Ser Arg Arg 
315 

AGT TCA GTA ACT 
Ser Ser Val Thr 



GTG GAG 430 
Val Glu 



GCA GTG 478 
Ala Val 



AAT AAT 526 
Asn Asn 
175 

GAC ATG 574 

Asp Met 

190 

GAG GGG 622 
Glu Gly 



CGA GAC 670 
Arg Asp 



GAT CGC 718 
Asp Arg 



ACC CCA 766 
Thr Pro 
255 

ATC TGT 814 

lie Cys 

270 

GGC AAG 862 
Gly Lys 



CTC CCT 910 
Leu Pro 



AAG AAG 958 
Lys Lys 



TTA TCC 1006 
Leu Ser 
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1378 320 325 330 335 

1379 

1380 CCT GTT GAT TCC CTA GAA TCT CCT CAC ACG TAT GTT TCC GAC ACC ACA 1054 

1381 Pro Val Asp Ser Leu Glu Ser Pro His Thr Tyr Val Ser Asp Thr Thr 

1382 340 345 350 
1383 

1384 TCC TCT CCA ATG ATT ACA TCC CCT GGG ATC TTA CAG GCC TCA CCC AAC 1102 

1385 Ser Ser Pro Met lie Thr Ser Pro Gly lie Leu Gin Ala Ser Pro Asn 

1386 355 360 365 
1387 

1388 CCT ATG TTG GCC ACT GCC GCC CCT CCT GCC CCA GTC CAT GCC CAG CAT 115 0 

1389 Pro Met Leu Ala Thr Ala Ala Pro Pro Ala Pro Val His Ala Gin His 

1390 370 375 380 
1391 

1392 GCA CTA TCT TTT TCT AAC CTT CAT GAA ATG CAG CCT TTG GCA CAT GGG 1198 

1393 Ala Leu Ser Phe Ser Asn Leu His Glu Met Gin Pro Leu Ala His Gly 

1394 385 390 395 
1395 

1396 GCC AGC ACT GTG CTT CCC TCA GTG AGC CAG TTG CTA TCC CAC CAC CAC 1246 
13 97 Ala Ser Thr Val Leu Pro Ser Val Ser Gin Leu Leu Ser His His His 
1398 400 405 410 415 

1399 

1400 ATT GTG TCT CCA GGC AGT GGC AGT GCT GGA AGC TTG AGT AGG CTC CAT 1294 

1401 lie Val Ser Pro Gly Ser Gly Ser Ala Gly Ser Leu Ser Arg Leu His 

1402 420 425 430 
1403 

1404 CCA GTC CCA GTC CCA GCA GAT TGG ATG AAC CGC ATG GAG GTG AAT GAG 1342 

1405 Pro Val Pro Val Pro Ala Asp Trp Met Asn Arg Met Glu Val Asn Glu 

1406 435 440 445 
1407 

1408 ACC CAG TAC AAT GAG ATG TTT GGT ATG GTC CTG GCT CCA GCT GAG GGC 1390 

1409 Thr Gin Tyr Asn Glu Met Phe Gly Met Val Leu Ala Pro Ala Glu Gly 

1410 450 455 460 
1411 

1412 ACC CAT CCT GGC ATA GCT CCC CAG AGC AGG CCA CCT GAA GGG AAG CAC 143 8 

1413 Thr His Pro Gly lie Ala Pro Gin Ser Arg Pro Pro Glu Gly Lys His 

1414 465 470 475 
1415 

1416 ATA ACC ACC CCT CGG GAG CCC TTG CCC CCC ATT GTG ACT TTC CAG CTC 1486 

1417 lie Thr Thr Pro Arg Glu Pro Leu Pro Pro lie Val Thr Phe Gin Leu 

1418 480 485 490 495 
1419 

1420 ATC CCT AAA GGC AGT ATT GCC CAA CCA GCG GGG GCT CCC CAG CCT CAG 1534 

1421 lie Pro Lys Gly Ser lie Ala Gin Pro Ala Gly Ala Pro Gin Pro Gin 

1422 500 505 510 
1423 

1424 TCC ACC TGC CCT CCA GCT GTT GCG GGC CCC CTG CCC ACC ATG TAC CAG 1582 

1425 Ser Thr Cys Pro Pro Ala Val Ala Gly Pro Leu Pro Thr Met Tyr Gin 

1426 515 520 525 
1427 

1428 ATT CCA GAA ATG GCC CGT TTG CCC AGT GTG GCT TTC CCC ACT GCC ATG 163 0 
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1429 lie Pro Glu Met Ala Arg Leu Pro Ser Val Ala Phe Pro Thr Ala Met 

1430 530 535 540 
1431 

1432 ATG CCC CAG CAG GAC GGG CAG GTA GCT CAG ACC ATT CTC CCA GCC TAT 16 78 

1433 Met Pro Gin Gin Asp Gly Gin Val Ala Gin Thr lie Leu Pro Ala Tyr 

1434 545 550 555 
1435 

1436 CAT CCT TTC CCA GCC TCT GTG GGC AAG TAC CCC ACA CCC CCT TCA CAG 1726 
143 7 His Pro Phe Pro Ala Ser Val Gly Lys Tyr Pro Thr Pro Pro Ser Gin 
1438 560 565 570 575 

1439 

1440 CAC AGT TAT GCT TCC TCA AAT GCT GCT GAG CGA ACA CCC AGT CAC AGT 17 74 

1441 His Ser Tyr Ala Ser Ser Asn Ala Ala Glu Arg Thr Pro Ser His Ser 

1442 580 585 590 
1443 

1444 GGT CAC CTC CAG GGT GAG CAT CCC TAC CTG ACA CCA TCC CCA GAG TCT 1822 

1445 Gly His Leu Gin Gly Glu His Pro Tyr Leu Thr Pro Ser Pro Glu Ser 

1446 595 600 605 
1447 

1448 CCT GAC CAG TGG TCA AGT TCA TCA CCC CAC TCT GCT TCT GAC TGG TCA 1870 

1449 Pro Asp Gin Trp Ser Ser Ser Ser Pro His Ser Ala Ser Asp Trp Ser 

1450 610 615 620 
1451 

1452 GAT GTG ACC ACC AGC CCT ACC CCT GGG GGT GCT GGA GGA GGT CAG CGG 1918 

1453 Asp Val Thr Thr Ser Pro Thr Pro Gly Gly Ala Gly Gly Gly Gin Arg 

1454 625 630 635 
1455 

1456 GGA CCT GGG ACA CAC ATG TCT GAG CCA CCA CAC AAC AAC ATG CAG GTT 1966 

1457 Gly Pro Gly Thr His Met Ser Glu Pro Pro His Asn Asn Met Gin Val 

1458 640 645 650 655 
1459 

1460 TAT GCG TGAGAGAGTC CACCTCCAGT GTAGAGACAT AACTGACTTT TGTAAATGCT 2022 

1461 Tyr Ala 
1462 

1463 

1464 GCTGAGGAAC AAATGAAGGT CATCCGGGAG AGAAATGAAG AAATCTCTGG AGCCAGCTTC 2082 
1465 

1466 TAGAGGTAGG AAAGAGAAGA TGTTCTTATT CAGATAATGC AAGAGAAGCA ATTCGTCAGT 2142 
1467 

1468 TTCACTGGGT ATCTGCAAGG CTTATTGATT ATTCTAATCT AATAAGACAA GTTTGTGGAA 22 02 
1469 

1470 ATGCAAGATG AATACAAGCC TTGGGTCCAT GTTTACTCTC TTCTATTTGG AGAATAAGAT 2262 
1471 

1472 GGATGCTTAT TGAAGCCCAG ACATTCTTGC AGCTTGGACT GCATTTTAAG CCCTGCAGGC 23 22 
1473 

1474 TTCTGCCATA TCCATGAGAA GATTCTACAC TAGCGTCCTG TTGGGAATTA TGCCCTGGAA 23 82 
1475 

1476 TTCTGCCTGA ATTGACCTAC GCATCTCCTC CTCCTTGGAC ATTCTTTTGT CTTCATTTGG 2442 
1477 

1478 TGCTTTTGGT TTTGCACCTC TCCGTGATTG TAGCCCTACC AGCATGTTAT AGGGCAAGAC 2502 
1479 
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1480 CTTTGTGCTT TTGATCATTC TGGCCCATGA AAGCAACTTT GGTCTCCTTT CCCCTCCTGT 2562 
1481 

1482 CTTCCCGGTA TCCCTTGGAG TCTCACAAGG TTTACTTTGG TATGGTTCTC AGCACAAACC 2622 
1483 

1484 TTTCAAGTAT GTTGTTTCTT TGGAAAATGG ACATACTGTA TTGTGTTCTC CTGCATATAT 2682 
1485 

1486 CATTCCTGGA GAGAGAAGGG GAGAAGAATA CTTTTCTTCA ACAAATTTTG GGGGCAGGAG 2 742 
1487 

1488 ATCCCTTCAA GAGGCTGCAC CTTAATTTTT CTTGTCTGTG TGCAGGTCTT CATATAAACT 2802 
1489 

1490 TTACCAGGAA GAAGGGTGTG AGTTTGTTGT TTTTCTGTGT ATGGGCCTGG TCAGTGTAAA 2862 
1491 

1492 GTTTTATCCT TGATAGTCTA GTTACTATGA CCCTCCCCAC TTTTTTAAAA CCAGAAAAAG 2 922 
1493 

1494 GTTTGGAATG TTGGAATGAC CAAGAGACAA GTTAACTCGT GCAAGAGCCA GTTACCCACC 2982 
1495 

1496 CACAGGTCCC CCTACTTCCT GCCAAGCATT CCATTGACTG CCTGTATGGA ACACATTTGT 3042 
1497 

1498 CCCAGATCTG AGCATTCTAG GCCTGTTTCA CTCACTCACC CAGCATATGA AACTAGTCTT 3102 
1499 

1500 AACTGTTGAG CCTTTCCTTT CATATCCACA GAAGACACTG TCTCAAATGT TGTACCCTTG 3162 
1501 

1502 CCATTTAGGA CTGAACTTTC CTTAGCCCAA GGGACCCAGT GACAGTTGTC TTCCGTTTGT 3222 
1503 

1504 CAGATGATCA GTCTCTACTG ATTATCTTGC TGCTTAAAGG CCTGCTCACC AATCTTTCTT 3282 
1505 

1506 TCACACCGTG TGGTCCGTGT TACTGGTATA CCCAGTATGT TCTCACTGAA GACATGGACT 3342 
1507 

1508 TTATATGTTC AAGTGCAGGA ATTGGAAAGT TGGACTTGTT TTCTATGATC CAAAACAGCC 3402 
1509 

1510 CTATAAGAAG GTTGGAAAAG GAGGAACTAT ATAGCAGCCT TTGCTATTTT CTGCTACCAT 3462 
1511 

1512 TTCTTTTCCT CTGAAGCGGC CATGACATTC CCTTTGGCAA CTAACGTAGA AACTCAACAG 3522 
1513 

1514 AACATTTTCC TTTCCTAGAG TCACCTTTTA GATGATAATG GACAACTATA GACTTGCTCA 3582 
1515 

1516 TTGTTCAGAC TGATTGCCCC TCACCTGAAT CCACTCTCTG TATTCATGCT CTTGGCAATT 3642 
1517 

1518 TCTTTGACTT TCTTTTAAGG GCAGAAGCAT TTTAGTTAAT TGTAGATAAA GAATAGTTTT 3702 
1519 

1520 CTTCCTCTTC TCCTTGGGCC AGTTAATAAT TGGTCCATGG CTACACTGCA ACTTCCGTCC 3 762 
1521 

1522 AGTGCTGTGA TGCCCATGAC ACCTGCAAAA TAAGTTCTGC CTGGGCATTT TGTAGATATT 3822 
1523 

1524 AACAGGTGAA TTCCCGACTC TTTTGGTTTG AATGACAGTT CTCATTCCTT CTATGGCTGC 3882 
1525 

1526 AAGTATGCAT CAGTGCTTCC CACTTACCTG ATTTGTCTGT CGGTGGCCCC ATATGGAAAC 3 942 
1527 

1528 CCTGCGTGTC TGTTGGCATA ATAGTTTACA AATGGTTTTT TCAGTCCTAT CCAAATTTAT 4002 
1529 

153 0 TGAACCAACA AAAATAATTA CTTCTGCCCT GAGATAAGCA GATTAAGTTT GTTCATTCTC 4062 
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1531 

1532 TGCTTTATTC TCTCCATGTG GCAACATTCT GTCAGCCTCT TTCATAGTGT GCAAACATTT 4122 
1533 

1534 TATCATTCTA AATGGTGACT CTCTGCCCTT GGACCCATTT ATTATTCACA GATGGGGAGA 4182 
1535 

1536 ACCTATCTGC ATGGACCCTC ACCATCCTCT GTGCAGCACA CACAGTGCAG GGAGCCAGTG 4242 
1537 

1538 GCGATGGCGA TGACTTTCTT CCCCTG 4268 

1539 

1540 

1541 (2) INFORMATION FOR SEQ ID NO: 13: 
1542 

1543 (i) SEQUENCE CHARACTERISTICS: 



1544 (A) LENGTH: 657 amino acids 

1545 (B) TYPE: amino acid 

1546 (D) TOPOLOGY: unknown 
1547 



1548 (ii) MOLECULE TYPE: protein 

1549 

1550 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

1551 

1552 Glu Val Asp Val Leu Asp Val Asn Val Arg Gly Pro Asp Gly Cys Thr 

1553 15 10 15 
1554 

1555 Pro Leu Met Leu Ala Ser Leu Arg Gly Gly Ser Ser Asp Leu Ser Asp 

1556 20 25 30 
1557 

1558 Glu Asp Glu Asp Ala Glu Asp Ser Ser Ala Asn lie lie Thr Asp Leu 

1559 35 40 45 
1560 

1561 Val Tyr Gin Gly Ala Ser Leu Gin Ala Gin Thr Asp Arg Thr Gly Glu 

1562 50 55 60 
1563 

1564 Met Ala Leu His Leu Ala Ala Arg Tyr Ser Arg Ala Asp Ala Ala Lys 

1565 65 70 75 80 
1566 

1567 Arg Leu Leu Asp Ala Gly Ala Asp Ala Asn Ala Gin Asp Asn Met Gly 

1568 85 90 95 
1569 

1570 Arg Cys Pro Leu His Ala Ala Val Ala Ala Asp Ala Gin Gly Val Phe 

1571 100 105 110 
1572 

1573 Gin lie Leu lie Arg Asn Arg Val Thr Asp Leu Asp Ala Arg Met Asn 

1574 115 120 125 
1575 

1576 Asp Gly Thr Thr Pro Leu lie Leu Ala Ala Arg Leu Ala Val Glu Gly 

1577 130 135 140 
1578 

1579 Met Val Ala Glu Leu lie Asn Cys Gin Ala Asp Val Asn Ala Val Asp 

1580 145 150 155 160 
1581 
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1582 Asp His Gly Lys Ser Ala Leu His Trp Ala Ala Ala Val Asn Asn Val 

1583 165 170 175 
1584 

1585 Glu Ala Thr Leu Leu Leu Leu Lys Asn Gly Ala Asn Arg Asp Met Gin 

1586 180 185 190 
1587 

1588 Asp Asn Lys Glu Glu Thr Pro Leu Phe Leu Ala Ala Arg Glu Gly Ser 

1589 195 200 205 
1590 

1591 Tyr Glu Ala Ala Lys lie Leu Leu Asp His Phe Ala Asn Arg Asp lie 

1592 210 215 220 
1593 

1594 Thr Asp His Met Asp Arg Leu Pro Arg Asp Val Ala Arg Asp Arg Met 

1595 225 230 235 * 240 
1596 

1597 His His Asp lie Val Arg Leu Leu Asp Glu Tyr Asn Val Thr Pro Ser 

1598 245 250 255 
1599 

1600 Pro Pro Gly Thr Val Leu Thr Ser Ala Leu Ser Pro Val lie Cys Gly 

1601 260 265 270 
1602 

1603 Pro Asn Arg Ser Phe Leu Ser Leu Lys His Thr Pro Met Gly Lys Lys 

1604 275 280 285 
1605 

1606 Ser Arg Arg Pro Ser Ala Lys Ser Thr Met Pro Thr Ser Leu Pro Asn 

1607 290 ' 295 300 
1608 

1609 Leu Ala Lys Glu Ala Lys Asp Ala Lys Gly Ser Arg Arg Lys Lys Ser 

1610 305 310 315 320 
1611 

1612 Leu Ser Glu Lys Val Gin Leu Ser Glu Ser Ser Val Thr Leu Ser Pro 

1613 325 330 335 
1614 

1615 Val Asp Ser Leu Glu Ser Pro His Thr Tyr Val Ser Asp Thr Thr Ser 

1616 340 345 350 
1617 

1618 Ser Pro Met lie Thr Ser Pro Gly lie Leu Gin Ala Ser Pro Asn Pro 

1619 355 360 365 
1620 

1621 Met Leu Ala Thr Ala Ala Pro Pro Ala Pro Val His Ala Gin His Ala 

1622 370 375 380 
1623 

1624 Leu Ser Phe Ser Asn Leu His Glu Met Gin Pro Leu Ala His Gly Ala 

1625 385 390 395 400 
1626 

1627 Ser Thr Val Leu Pro Ser Val Ser Gin Leu Leu Ser His His His lie 

1628 405 410 415 
1629 

163 0 Val Ser Pro Gly Ser Gly Ser Ala Gly Ser Leu Ser Arg Leu His Pro 

1631 420 425 430 

1632 
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1633 Val Pro Val Pro Ala Asp Trp Met Asn Arg Met Glu Val Asn Glu Thr 

1634 435 440 445 
1635 

1636 Gin Tyr Asn Glu Met Phe Gly Met Val Leu Ala Pro Ala Glu Gly Thr 

1637 450 455 460 
1638 

1639 His Pro Gly lie Ala Pro Gin Ser Arg Pro Pro Glu Gly Lys His lie 

1640 465 470 475 480 
1641 

1642 Thr Thr Pro Arg Glu Pro Leu Pro Pro lie Val Thr Phe Gin Leu lie 

1643 485 490 495 
1644 

1645 Pro Lys Gly Ser lie Ala Gin Pro Ala Gly Ala Pro Gin Pro Gin Ser 

1646 500 505 510 
1647 

1648 Thr Cys Pro Pro Ala Val Ala Gly Pro Leu Pro Thr Met Tyr Gin lie 

1649 515 520 525 
1650 

1651 Pro Glu Met Ala Arg Leu Pro Ser Val Ala Phe Pro Thr Ala Met Met 

1652 530 535 540 
1653 

1654 Pro Gin Gin Asp Gly Gin Val Ala Gin Thr lie Leu Pro Ala Tyr His 

1655 545 550 555 560 
1656 

1657 Pro Phe Pro Ala Ser Val Gly Lys Tyr Pro Thr Pro Pro Ser Gin His 

1658 565 570 575 
1659 

1660 Ser Tyr Ala Ser Ser Asn Ala Ala Glu Arg Thr Pro Ser His Ser Gly 

1661 580 585 590 
1662 

1663 His Leu Gin Gly Glu His Pro Tyr Leu Thr Pro Ser Pro Glu Ser Pro 

1664 595 600 605 
1665 

1666 Asp Gin Trp Ser Ser Ser Ser Pro His Ser Ala Ser Asp Trp Ser Asp 

1667 610 615 620 
1668 

1669 Val Thr Thr Ser Pro Thr Pro Gly Gly Ala Gly Gly Gly Gin Arg Gly 

1670 625 630 635 640 
1671 

1672 Pro Gly Thr His Met Ser Glu Pro Pro His Asn Asn Met Gin Val Tyr 

1673 645 650 655 
1674 

1675 Ala 

1676 

1677 

1678 (2) INFORMATION FOR SEQ ID NO : 14 : 
1679 

1680 (i) SEQUENCE CHARACTERISTICS: 

1681 (A) LENGTH: 77 amino acids 

1682 (B) TYPE: amino acid 

1683 (C) STRANDEDNESS : single 
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1684 (D) TOPOLOGY: unknown 

1685 

1686 (ii) MOLECULE TYPE: peptide 

1687 

1688 

1689 

1690 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

1691 

1692 Glu Asp lie Asp Glu Cys Asp Gin Gly Ser Pro Cys Glu His Asn Gly 

1693 15 10 15 
1694 

1695 lie Cys Val Asn Thr Pro Gly Ser Tyr Arg Cys Asn Cys Ser Gin Gly 

1696 20 25 30 
1697 

1698 Phe Thr Gly Pro Arg Cys Glu Thr Asn lie Asn Glu Cys Glu Ser His 

1699 35 40 45 
1700 

1701 Pro Cys Gin Asn Glu Gly Ser Cys Leu Asp Asp Pro Gly Thr Phe Arg 

1702 50 55 60 
1703 

1704 Cys Val Cys Met Pro Gly Phe Thr Gly Thr Gin Cys Glu 

1705 65 70 75 
1706 

1707 (2) INFORMATION FOR SEQ ID NO:15: 
1708 

1709 (i) SEQUENCE CHARACTERISTICS: 



1710 (A) LENGTH: 78 amino acids 

1711 (B) TYPE: amino acid 

1712 (C) STRANDEDNESS : single 

1713 (D) TOPOLOGY: unknown 
1714 



1715 (ii) MOLECULE TYPE: peptide 

1716 

1717 

1718 

1719 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

1720 

1721 Asn Asp Val Asp Glu Cys Ser Leu Gly Ala Asn Pro Cys Glu His Gly 

1722 15 10 15 
1723 

1724 Gly Arg Cys Thr Asn Thr Leu Gly Ser Phe Gin Cys Asn Cys Pro Gin 

1725 20 25 30 
1726 

172 7 Gly Tyr Ala Gly Pro Arg Cys Glu lie Asp Val Asn Glu Cys Leu Ser 
1728 35 40 45 

1729 

173 0 Asn Pro Cys Gin Asn Asp Ser Thr Cys Leu Asp Gin lie Gly Glu Phe 
1731 50 55 60 

1732 

1733 Gin Cys lie Cys Met Pro Gly Tyr Glu Gly Leu Tyr Cys Glu 

1734 65 70 75 
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1735 
1736 

1737 (2) INFORMATION FOR SEQ ID NO: 16: 
1738 

1739 (i) SEQUENCE CHARACTERISTICS: 

1740 (A) LENGTH: 654 amino acids 

1741 (B) TYPE: amino acid 

1742 (C) STRANDEDNESS : single 

1743 (D) TOPOLOGY: unknown 
1744 

1745 (ii) MOLECULE TYPE: peptide 

1746 

1747 

1748 

1749 

1750 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

1751 

1752 Thr Pro Pro Gin Gly Glu He Glu Ala Asp Cys Met Asp Val Asn Val 

1753 15 10 15 
1754 

1755 Arg Gly Pro Asp Gly Phe Thr Pro Leu Met He Ala Ser Cys Ser Gly 

1756 20 25 30 
1757 

1758 Gly Gly Leu Glu Thr Gly Asn Ser Glu Glu Glu Glu Asp Ala Ser Ala 

1759 35 40 45 
1760 

1761 Asn Met He Ser Asp Phe He Gly Gin Gly Ala Gin Leu His Asn Gin 

1762 50 55 60 
1763 

1764 Thr Asp Arg Thr Gly Glu Thr Ala Leu His Leu Ala Ala Arg Tyr Ala 

1765 65 70 75 80 
1766 

1767 Arg Ala Asp Ala Ala Lys Arg Leu Leu Glu Ser Ser Ala Asp Ala Asn 

1768 85 90 95 
1769 

1770 Val Gin Asp Asn Met Gly Arg Thr Pro Leu His Ala Ala Val Ala Ala 

1771 100 105 110 
1772 

1773 Asp Ala Gin Gly Val Phe Gin He Leu He Arg Asn Arg Ala Thr Asp 

1774 115 120 125 
1775 

1776 Leu Asp Ala Arg Met Phe Asp Gly Thr Thr Pro Leu He Leu Ala Ala 

1777 130 135 140 
1778 

1779 Arg Leu Ala Val Glu Gly Met Val Glu Glu Leu He Asn Ala His Ala 

1780 145 150 155 160 
1781 

1782 Asp Val Asn Ala Val Asp Glu Phe Gly Lys Ser Ala Leu His Trp Ala 

1783 165 170 175 
1784 

1785 Ala Ala Val Asn Asn Val Asp Ala Ala Ala Val Leu Leu Lys Asn Ser 
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1786 180 185 190 

1787 

1788 Ala Asn Lys Asp Met Gin Asn Asn Lys Glu Glu Thr Ser Leu Phe Leu 

1789 195 200 205 
1790 

1791 Ala Ala Arg Glu Gly Ser Tyr Glu Thr Ala Lys Val Leu Leu Asp His 

1792 210 215 220 
1793 

1794 Tyr Ala Asn Arg Asp lie Thr Asp His Met Asp Arg Leu Pro Arg Asp 

1795 225 230 235 240 
1796 

1797 lie Ala Gin Glu Arg Met His His Asp lie Val His Leu Leu Asp Glu 

1798 245 250 255 
1799 

1800 Tyr Asn Leu Val Lys Ser Pro Thr Leu His Asn Gly Pro Leu Gly Ala 

1801 260 265 270 
1802 

1803 Thr Thr Leu Ser Pro Pro lie Cys Ser Pro Asn Gly Tyr Met Gly Asn 

1804 275 280 285 
1805 

1806 Met Lys Pro Ser Val Gin Ser Lys Lys Ala Arg Lys Pro Ser lie Lys 

1807 290 295 300 
1808 

1809 Gly Asn Gly Cys Lys Glu Ala Lys Glu Leu Lys Ala Arg Arg Lys Lys 

1810 305 310 315 320 
1811 

1812 Ser Gin Asp Gly Lys Thr Thr Leu Leu Asp Ser Gly Ser Ser Gly Val 

1813 325 330 335 
1814 

1815 Leu Ser Pro Val Asp Ser Leu Glu Ser Thr His Gly Tyr Leu Ser Asp 

1816 340 345 350 
1817 

1818 Val Ser Ser Pro Pro Leu Met Thr Ser Pro Phe Gin Gin Ser Pro Ser 

1819 355 360 365 
1820 

1821 Met Pro Leu Asn His Leu Thr Ser Met Pro Glu Ser Gin Leu Gly Met 

1822 370 375 380 
1823 

1824 Asn His lie Asn Met Ala Thr Lys Gin Glu Met Ala Ala Gly Ser Asn 

1825 385 390 395 400 
1826 

1827 Arg Met Ala Phe Asp Ala Met Val Pro Arg Leu Thr His Leu Asn Ala 

1828 405 410 415 
1829 

1830 Ser Ser Pro Asn Thr lie Met Ser Asn Gly Ser Met His Phe Thr Val 

1831 420 425 430 
1832 

1833 Gly Gly Ala Pro Thr Met Asn Ser Gin Cys Asp Trp Leu Ala Arg Leu 

1834 435 440 445 
1835 

1836 Gin Asn Gly Met Val Gin Asn Gin Tyr Asp Pro lie Arg Asn Gly lie 
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1837 450 455 460 

1838 

1839 Gin Gin Gly Asn Ala Gin Gin Ala Gin Ala Leu Gin His Gly Leu Met 

1840 465 470 475 480 
1841 

1842 Thr Ser Leu His Asn Gly Leu Pro Ala Thr Thr Leu Ser Gin Met Met 

1843 485 490 495 
1844 

1845 Thr Tyr Gin Ala Met Pro Asn Thr Arg Leu Ala Asn Gin Pro His Leu 

1846 500 505 510 
1847 

1848 Met Gin Ala Gin Gin Met Gin Gin Gin Gin Asn Leu Gin Leu His Gin 

1849 515 520 525 
1850 

1851 Ser Met Gin Gin Gin His His Asn Ser Ser Thr Thr Ser Thr His lie 

1852 530 535 540 
1853 

1854 Asn Ser Pro Phe Cys Ser Ser Asp lie Ser Gin Thr Asp Leu Gin Gin 

1855 545 550 555 560 
1856 

1857 Met Ser Ser Asn Asn lie His Ser Val Met Pro Gin Asp Thr Gin lie 

1858 565 570 575 
1859 

1860 Phe Ala Ala Ser Leu Pro Ser Asn Leu Thr Gin Ser Met Thr Thr Ala 

1861 580 585 590 
1862 

1863 Gin Phe Leu Thr Pro Pro Ser Gin His Ser Tyr Ser Ser Pro Met Asp 

1864 595 600 605 
1865 

1866 Asn Thr Pro Ser His Gin Leu Gin Val Pro Asp His Pro Phe Leu Thr 

1867 610 615 620 
1868 

1869 Pro Ser Pro Glu Ser Pro Asp Gin Trp Ser Ser Ser Ser Pro His Ser 

1870 625 630 635 640 
1871 

1872 Asn Met Ser Asp Trp Ser Glu Gly He Ser Ser Pro Pro Thr 

1873 645 650 
1874 

1875 (2) INFORMATION FOR SEQ ID NO: 17: 
1876 

1877 (i) SEQUENCE CHARACTERISTICS: 

1878 (A) LENGTH: 666 amino acids 

1879 (B) TYPE: amino acid 

1880 (C) STRANDEDNESS : single 

1881 (D) TOPOLOGY: unknown 
1882 

1883 (ii) MOLECULE TYPE: peptide 

1884 

1885 

1886 

1887 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
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1888 

1889 Thr Pro Pro Gin Gly Glu Val Asp Ala Asp Cys Met Asp Val Asn Val 

1890 1 5 10 * 15 
1891 

1892 Arg Gly Pro Asp Gly Phe Thr Pro Leu Met lie Ala Ser Cys Ser Gly 

1893 20 25 30 
1894 

1895 Gly Gly Leu Glu Thr Gly Asn Ser Glu Glu Glu Glu Asp Ala Pro Ala 

1896 35 40 45 
1897 

1898 Val lie Ser Asp Phe lie Tyr Gin Gly Ala Ser Leu His Asn Gin Thr 

1899 50 55 60 
1900 

1901 Asp Arg Thr Gly Glu Thr Ala Leu His Leu Ala Ala Arg Tyr Ser Arg 

1902 65 70 75 80 
1903 

1904 Ser Asp Ala Ala Lys Arg Leu Leu Glu Ala Ser Ala Asp Ala Asn lie 

1905 85 90 95 
1906 

1907 Gin Asp Asn Met Gly Arg Thr Pro Leu His Ala Ala Val Ser Ala Asp 

1908 100 105 110 
1909 

1910 Ala Gin Gly Val Phe Gin lie Leu Leu Arg Asn Arg Ala Thr Asp Leu 

1911 115 120 125 
1912 

1913 Asp Ala Arg Met His Asp Gly Thr Thr Pro Leu lie Leu Ala Ala Arg 

1914 130 135 140 
1915 

1916 Leu Ala Val Glu Gly Met Leu Glu Asp Leu lie Asn Ser His Ala Asp 

1917 145 150 155 160 
1918 

1919 Val Asn Ala Val Asp Asp Leu Gly Lys Ser Ala Leu His Trp Ala Ala 

1920 165 170 175 
1921 

1922 Ala Val Asn Asn Val Asp Ala Ala Val Val Leu Leu Lys Asn Gly Ala 

1923 180 185 190 
1924 

1925 Asn Lys Asp Met Gin Asn Asn Lys Glu Glu Thr Pro Leu Phe Leu Ala 

1926 195 200 205 
1927 

1928 Ala Arg Glu Gly Ser Tyr Glu Thr Ala Lys Val Leu Leu Asp His Phe 

1929 210 215 220 
1930 

1931 Ala Asn Arg Asp lie Thr Asp His Met Asp Arg Leu Pro Arg Asp lie 

1932 225 230 235 240 
1933 

1934 Ala Gin Glu Arg Met His His Asp lie Val Arg Leu Leu Asp Glu Tyr 

1935 245 250 255 
1936 

193 7 Asn Leu Val Arg Ser Pro Gin Leu His Gly Thr Ala Leu Gly Gly Thr 

1938 260 265 270 
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1939 

1940 Pro Thr Leu Ser Pro Thr Leu Cys Ser Pro Asn Gly Tyr Leu Gly Asn 

1941 275 280 285 
1942 

1943 Leu Lys Ser Ala Thr Gin Gly Lys Lys Ala Arg Lys Pro Ser Thr Lys 

1944 290 295 300 
1945 

1946 Gly Leu Ala Cys Ser Ser Lys Glu Ala Lys Asp Leu Lys Ala Arg Arg 

1947 305 310 315 320 
1948 

1949 Lys Lys Ser Gin Asp Gly Lys Gly Cys Leu Leu Asp Ser Ser Ser Met 

1950 325 330 335 
1951 

1952 Leu Ser Pro Val Asp Ser Leu Glu Ser Pro His Gly Tyr Leu Ser Asp 

1953 340 345 350 
1954 

1955 Val Ala Ser Pro Pro Leu Pro Ser Pro Phe Gin Gin Ser Pro Ser Met 

1956 355 360 365 
1957 

1958 Pro Leu Ser His Leu Pro Gly Met Pro Asp Thr His Leu Gly lie Ser 

1959 370 375 380 
1960 

1961 His Leu Asn Val Ala Ala Lys Pro Glu Met Ala Ala Leu Ala Gly Gly 

1962 385 390 395 400 
1963 

1964 Ser Arg Leu Ala Phe Glu Pro Pro Pro Pro Arg Leu Ser His Leu Pro 

1965 405 410 415 
1966 

1967 Val Ala Ser Ser Ala Ser Thr Val Leu Ser Thr Asn Gly Thr Gly Ala 

1968 420 425 430 
1969 

1970 Met Asn Phe Thr Val Gly Ala Pro Ala Ser Leu Asn Gly Gin Cys Glu 

1971 435 440 445 
1972 

1973 Trp Leu Pro Arg Leu Gin Asn Gly Met Val Pro Ser Gin Tyr Asn Pro 

1974 450 455 460 
1975 

1976 Leu Arg Pro Gly Val Thr Pro Gly Thr Leu Ser Thr Gin Ala Ala Gly 

1977 465 470 475 480 
1978 

1979 Leu Gin His Gly Met Met Ser Pro He His Ser Ser Leu Ser Thr Asn 

1980 485 490 495 
1981 

1982 Thr Leu Ser Pro He He Tyr Gin Gly Leu Pro Asn Thr Arg Leu Ala 

1983 500 505 510 
1984 

1985 Thr Gin Pro His Leu Val Gin Thr Gin Gin Val Gin Pro Gin Asn Leu 

1986 515 520 525 
1987 

1988 Gin He Gin Pro Gin Asn Leu Gin Pro Pro Ser Gin Pro His Leu Ser 

1989 530 535 540 
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1990 

1991 Val Ser Ser Ala Ala Asn Gly His Leu Gly Arg Ser Phe Leu Ser Gly 

1992 545 550 555 560 
1993 

1994 Glu Pro Ser Gin Ala Asp Val Gin Pro Leu Gly Pro Ser Ser Leu Pro 

1995 565 570 575 
1996 

1997 Val His Thr lie Leu Pro Gin Glu Ser Gin Ala Leu Pro Thr Ser Leu 

1998 580 585 590 
1999 

2000 Pro Ser Ser Met Val Pro Pro Met Thr Thr Thr Gin Phe Leu Thr Pro 

2001 595 600 605 
2002 

2003 Pro Ser Gin His Ser Tyr Ser Ser Ser Pro Val Asp Asn Thr Pro Ser 

2004 610 615 620 
2005 

2006 His Gin Leu Gin Val Pro Glu His Pro Phe Leu Thr Pro Ser Pro Glu 

2007 625 630 635 640 
2008 

2 009 Ser Pro Asp Gin Trp Ser Ser Ser Ser Arg His Ser Asn lie Ser Asp 

2010 645 650 655 

2011 

2 012 Trp Ser Glu Gly lie Ser Ser Pro Pro Thr 

2013 660 665 

2014 

2015 (2) INFORMATION FOR SEQ ID NO: 18: 
2016 

2017 (i) SEQUENCE CHARACTERISTICS: 



2018 (A) LENGTH: 681 amino acids 

2019 (B) TYPE: amino acid 

2020 (C) STRANDEDNESS : single 

2021 (D) TOPOLOGY: unknown 
2022 



2023 (ii) MOLECULE TYPE: peptide 

2024 

2025 

2026 

2027 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

2028 

202 9 Thr Pro Pro Gin Gly Glu Val Asp Ala Asp Cys Met Asp Val Asn Val 

2030 15 10 15 

2031 

2032 Arg Gly Pro Asp Gly Phe Thr Pro Leu Met lie Ala Ser Cys Ser Gly 

2033 20 25 30 
2034 

2035 Gly Gly Leu Glu Thr Gly Asn Ser Glu Glu Glu Glu Asp Ala Pro Ala 

2036 35 40 45 
2037 

2038 Val lie Ser Asp Phe lie Tyr Gin Gly Ala Ser Leu His Asn Gin Thr 

2039 50 55 60 
2040 



# 
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2041 Asp Arg Thr Gly Glu Thr Ala Leu His Leu Ala Ala Arg Tyr Ser Arg 

2042 65 70 75 80 
2043 

2044 Ser Asp Ala Ala Lys Arg Leu Leu Glu Ala Ser Ala Asp Ala Asn lie 

2045 85 90 95 
2046 

2047 Gin Asp Asn Met Gly Arg Thr Pro Leu His Ala Ala Val Ser Ala Asp 

2048 100 105 110 
2049 

2050 Ala Gin Gly Val Phe Gin lie Leu lie Arg Asn Arg Ala Thr Asp Leu 

2051 115 120 125 
2052 

2 053 Asp Ala Arg Met His Asp Gly Thr Thr Pro Leu lie Leu Ala Ala Arg 

2054 130 135 140 

2055 

2056 Leu Ala Val Glu Gly Met Leu Glu Asp Leu lie Asn Ser His Ala Asp 

2057 145 150 155 160 
2058 

2059 Val Asn Ala Val Asp Asp Leu Gly Lys Ser Ala Leu His Trp Ala Ala 

2060 165 170 175 
2061 

2062 Ala Val Asn Asn Val Asp Ala Ala Val Val Leu Leu Lys Asn Gly Ala 

2063 180 185 190 
2064 

2065 Asn Lys Asp Met Gin Asn Asn Arg Glu Glu Thr Pro Leu Phe Leu Ala 

2066 195 200 205 
2067 

2068 Ala Arg Glu Gly Ser Tyr Glu Thr Ala Lys Val Leu Leu Asp His Phe 

2069 210 215 220 
2070 

2 071 Ala Asn Arg Asp lie Thr Asp His Met Asp Arg Leu Pro Arg Asp lie 

2072 225 230 235 240 

2073 

2074 Ala Gin Glu Arg Met His His Asp lie Val Arg Leu Leu Asp Glu Tyr 

2075 245 250 255 
2076 

2077 Asn Leu Val Arg Ser Pro Gin Leu His Gly Ala Pro Leu Gly Gly Thr 

2078 260 265 270 
2079 

2080 Pro Thr Leu Ser Pro Pro Leu Cys Ser Pro Asn Gly Tyr Leu Gly Ser 

2081 275 280 285 
2082 

2083 Leu Lys Pro Gly Val Gin Gly Lys Lys Val Arg Lys Pro Ser Ser Lys 

2084 290 295 300 
2085 

2086 Gly Leu Ala Cys Gly Ser Lys Glu Ala Lys Asp Leu Lys Ala Arg Arg 

2087 305 310 315 320 
2088 

2089 Lys Lys Ser Gin Asp Gly Lys Gly Cys Leu Leu Asp Ser Ser Gly Met 

2090 325 330 335 
2091 
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2092 
2093 
2094 
2095 
2096 
2097 
2098 
2099 
2100 
2101 
2102 
2103 
2104 
2105 
2106 
2107 
2108 
2109 
2110 
2111 
2112 
2113 
2114 
2115 
2116 
2117 
2118 
2119 
2120 
2121 
2122 
2123 
2124 
2125 
2126 
2127 
2128 
2129 
2130 
2131 
2132 
2133 
2134 
2135 
2136 
2137 
2138 
2139 
2140 
2141 
2142 
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Leu Ser Pro Val Asp Ser Leu Glu Ser Pro His Gly Tyr Leu Ser Asp 
340 345 350 

Val Ala Ser Pro Pro Leu Leu Pro Ser Pro Phe Gin Gin Ser Pro Ser 
355 360 365 

Val Pro Leu Asn His Leu Pro Gly Met Pro Asp Thr His Leu Gly lie 
370 375 380 

Gly His Leu Asn Val Ala Ala Lys Pro Glu Met Ala Ala Leu Gly Gly 
385 _ 390 395 400 

Gly Gly Arg Leu Ala Phe Glu Thr Gly Pro Pro Arg Leu Ser His Leu 
405 410 415 

Pro Val Ala Ser Gly Thr Ser Thr Val Leu Gly Ser Ser Ser Gly Gly 
420 425 430 

Ala Leu Asn Phe Thr Val Gly Gly Ser Thr Ser Leu Asn Gly Gin Cys 
435 440 445 

Glu Trp Leu Ser Arg Leu Gin Ser Gly Met Val Pro Asn Gin Tyr Asn 
450 455 460 

Pro Leu Arg Gly Ser Val Ala Pro Gly Pro Leu Ser Thr Gin Ala Pro 
465 470 475 480 

Ser Leu Gin His Gly Met Val Gly Pro Leu His Ser Ser Leu Ala Ala 
485 490 495 

Ser Ala Leu Ser Gin Met Met Ser Tyr Gin Gly Leu Pro Ser Thr Arg 
500 505 510 

Leu Ala Thr Gin Pro His Leu Val Gin Thr Gin Gin Val Gin Pro Gin 
515 520 525 

Asn Leu Gin Met Gin Gin Gin Asn Leu Gin Pro Ala Asn lie Gin Gin 
530 535 540 

Gin Gin Ser Leu Gin Pro Pro Pro Pro Pro Pro Gin Pro His Leu Gly 
545 550 555 560 

Val Ser Ser Ala Ala Ser Gly His Leu Gly Arg Ser Phe Leu Ser Gly 
565 570 575 

Glu Pro Ser Gin Ala Asp Val Gin Pro Leu Gly Pro Ser Ser Leu Ala 
580 585 590 

Val His Thr lie Leu Pro Gin Glu Ser Pro Ala Leu Pro Thr Ser Leu 
595 600 605 



* 
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2143 Pro Ser Ser Leu Val Pro Pro Val Thr Ala Ala Gin Phe Leu Thr Pro 

2144 610 615 620 
2145 

2146 Pro Ser Gin His Ser Tyr Ser Ser Pro Val Glu Asn Thr Pro Ser His 

2147 625 630 635 640 
2148 

2149 Gin Leu Gin Val Pro Glu His Pro Phe Leu Thr Pro Ser Pro Glu Ser 

2150 645 650 655 
2151 

2152 Pro Asp Gin Trp Ser Ser Ser Ser Pro His Ser Asn Val Ser Asp Trp 

2153 660 665 670 
2154 

2155 Ser Glu Gly Val Ser Ser Pro Pro Thr 

2156 675 680 
2157 

2158 (2) INFORMATION FOR SEQ ID NO: 19: 
2159 

2160 (i) SEQUENCE CHARACTERISTICS: 



2161 (A) LENGTH: 2471 amino acids 

2162 (B) TYPE: amino acid 

2163 (C) STRANDEDNESS : single 

2164 (D) TOPOLOGY: unknown 
2165 



2166 (ii) MOLECULE TYPE: peptide 

2167 

2168 

2169 

2170 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

2171 

2172 Met Pro Ala Leu Arg Pro Ala Leu Leu Trp Ala Leu Leu Ala Leu Trp 

2173 15 10 15 
2174 

2175 Leu Cys Cys Ala Ala Pro Ala His Ala Leu Gin Cys Arg Asp Gly Tyr 

2176 20 25 30 
2177 

2178 Glu Pro Cys Val Asn Glu Gly Met Cys Val Thr Tyr His Asn Gly Thr 

2179 35 40 45 
2180 

2181 Gly Tyr Cys Lys Cys Pro Glu Gly Phe Leu Gly Glu Tyr Cys Gin His 

2182 50 55 60 
2183 

2184 Arg Asp Pro Cys Glu Lys Asn Arg Cys Gin Asn Gly Gly Thr Cys Val 

2185 65 70 75 80 
2186 

2187 Ala Gin Ala Met Leu Gly Lys Ala Thr Cys Arg Cys Ala Ser Gly Phe 

2188 85 90 95 
2189 

2190 Thr Gly Glu Asp Cys Gin Tyr Ser Thr Ser His Pro Cys Phe Val Ser 

2191 100 105 110 
2192 

2193 Arg Pro Cys Leu Asn Gly Gly Thr Cys His Met Leu Ser Arg Asp Thr 
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2194 115 120 125 

2195 

2196 Tyr Glu Cys Thr Cys Gin Val Gly Phe Thr Gly Lys Glu Cys Gin Trp 

2197 130 135 140 
2198 

2199 Thr Asp Ala Cys Leu Ser His Pro Cys Ala Asn Gly Ser Thr Cys Thr 

2200 145 150 155 160 
2201 

2202 Thr Val Ala Asn Gin Phe Ser Cys Lys Cys Leu Thr Gly Phe Thr Gly 

2203 165 170 175 
2204 

2205 Gin Lys Cys Glu Thr Asp Val Asn Glu Cys Asp lie Pro Gly His Cys 

2206 180 185 190 
2207 

2208 Gin His Gly Gly Thr Cys Leu Asn Leu Pro Gly Ser Tyr Gin Cys Gin 

2209 195 200 205 
2210 

2211 Cys Pro Gin Gly Phe Thr Gly Gin Tyr Cys Asp Ser Leu Tyr Val Pro 

2212 210 215 220 
2213 

2214 Cys Ala Pro Ser Pro Cys Val Asn Gly Gly Thr Cys Arg Gin Thr Gly 

2215 225 230 235 240 
2216 

2217 Asp Phe Thr Phe Glu Cys Asn Cys Leu Pro Gly Phe Glu Gly Ser Thr 

2218 245 250 255 
2219 

2220 Cys Glu Arg Asn lie Asp Asp Cys Pro Asn His Arg Cys Gin Asn Gly 

2221 260 265 270 
2222 

2223 Gly Val Cys Val Asp Gly Val Asn Thr Tyr Asn Cys Arg Cys Pro Pro 

2224 275 280 285 
2225 

2226 Gin Trp Thr Gly Gin Phe Cys Thr Glu Asp Val Asp Glu Cys Leu Leu 

2227 290 295 300 
2228 

222 9 Gin Pro Asn Ala Cys Gin Asn Gly Gly Thr Cys Ala Asn Arg Asn Gly 

2230 305 310 315 320 

2231 

2232 Gly Tyr Gly Cys Val Cys Val Asn Gly Trp Ser Gly Asp Asp Cys Ser 

2233 325 330 335 
2234 

2235 Glu Asn lie Asp Asp Cys Ala Phe Ala Ser Cys Thr Pro Gly Ser Thr 

2236 340 345 350 
2237 

2238 Cys lie Asp Arg Val Ala Ser Phe Ser Cys Met Cys Pro Glu Gly Lys 

2239 355 360 365 
2240 

2241 Ala Gly Leu Leu Cys His Leu Asp Asp Ala Cys He Ser Asn Pro Cys 

2242 370 375 380 
2243 

2244 His Lys Gly Ala Leu Cys Asp Thr Asn Pro Leu Asn Gly Gin Tyr He 
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2245 385 390 395 400 

2246 

2247 Cys Thr Cys Pro Gin Gly Tyr Lys Gly Ala Asp Cys Thr Glu Asp Val 

2248 405 410 415 
2249 

2250 Asp Glu Cys Ala Met Ala Asn Ser Asn Pro Cys Glu His Ala Gly Lys 

2251 420 425 430 
2252 

2253 Cys Val Asn Thr Asp Gly Ala Phe His Cys Glu Cys Leu Lys Gly Tyr 

2254 435 440 445 
2255 

2256 Ala Gly Pro Arg Cys Glu Met Asp lie Asn Glu Cys His Ser Asp Pro 

2257 450 455 460 
2258 

2259 Cys Gin Asn Asp Ala Thr Cys Leu Asp Lys He Gly Gly Phe Thr Cys 

2260 465 470 475 480 
2261 

2262 Leu Cys Met Pro Gly Phe Lys Gly Val His Cys Glu Leu Glu He Asn 

2263 485 490 495 
2264 

2265 Glu Cys Gin Ser Asn Pro Cys Val Asn Asn Gly Gin Cys Val Asp Lys 

2266 500 505 510 
2267 

2268 Val Asn Arg Phe Gin Cys Leu Cys Pro Pro Gly Phe Thr Gly Pro Val 

2269 515 520 525 
2270 

2271 Cys Gin He Asp He Asp Asp Cys Ser Ser Thr Pro Cys Leu Asn Gly 

2272 530 535 540 
2273 

22 74 Ala Lys Cys He Asp His Pro Asn Gly Tyr Glu Cys Gin Cys Ala Thr 

2275 545 550 555 560 

2276 

2277 Gly Phe Thr Gly Val Leu Cys Glu Glu Asn He Asp Asn Cys Asp Pro 

2278 565 570 575 
2279 

2280 Asp Pro Cys His His Gly Gin Cys Gin Asp Gly He Asp Ser Tyr Thr 

2281 580 585 590 
2282 

2283 Cys He Cys Asn Pro Gly Tyr Met Gly Ala He Cys Ser Asp Gin He 

2284 595 600 605 
2285 

2286 Asp Glu Cys Tyr Ser Ser Pro Cys Leu Asn Asp Gly Arg Cys He Asp 

2287 610 615 620 
2288 

228 9 Leu Val Asn Gly Tyr Gin Cys Asn Cys Gin Pro Gly Thr Ser Gly Val 

2290 625 630 635 640 

2291 

22 92 Asn Cys Glu He Asn Phe Asp Asp Cys Ala Ser Asn Pro Cys He His 

2293 645 650 655 

2294 

2295 Gly He Cys Met Asp Gly He Asn Arg Tyr Ser Cys Val Cys Ser Pro 
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2296 660 _ 665 670 

2297 

2298 Gly Phe Thr Gly Gin Arg Cys Asn lie Asp lie Asp Glu Cys Ala Ser 

2299 675 680 685 
2300 

2301 Asn Pro Cys Arg Lys Gly Ala Thr Cys lie Asn Gly Val Asn Gly Phe 

2302 690 695 700 
2303 

2304 Arg Cys lie Cys Pro Glu Gly Pro His His Pro Ser Cys Tyr Ser Gin 

2305 705 710 715 720 
2306 

23 07 Val Asn Glu Cys Leu Ser Asn Pro Cys lie His Gly Asn Cys Thr Gly 

2308 725 730 735 

2309 

2310 Gly Leu Ser Gly Tyr Lys Cys Leu Cys Asp Ala Gly Trp Val Gly lie 

2311 740 745 750 
2312 

2313 Asn Cys Glu Val Asp Lys Asn Glu Cys Leu Ser Asn Pro Cys Gin Asn 

2314 755 760 765 
2315 

2316 Gly Gly Thr Cys Asp Asn Leu Val Asn Gly Tyr Arg Cys Thr Cys Lys 

2317 770 775 780 
2318 

2319 Lys Gly Phe Lys Gly Tyr Asn Cys Gin Val Asn lie Asp Glu Cys Ala 

2320 785 790 795 800 
2321 

2322 Ser Asn Pro Cys Leu Asn Gin Gly Thr Cys Phe Asp Asp lie Ser Gly 

2323 805 810 815 
2324 

2325 Tyr Thr Cys His Cys Val Leu Pro Tyr Thr Gly Lys Asn Cys Gin Thr 

2326 820 825 830 
2327 

2328 Val Leu Ala Pro Cys Ser Pro Asn Pro Cys Glu Asn Ala Ala Val Cys 

2329 835 840 845 
2330 

2331 Lys Glu Ser Pro Asn Phe Glu Ser Tyr Thr Cys Leu Cys Ala Pro Gly 

2332 850 855 860 
2333 

2334 Trp Gin Gly Gin Arg Cys Thr lie Asp lie Asp Glu Cys lie Ser Lys 

2335 865 870 875 880 
2336 

233 7 Pro Cys Met Asn His Gly Leu Cys His Asn Thr Gin Gly Ser Tyr Met 

2338 885 890 895 

2339 

2340 Cys Glu Cys Pro Pro Gly Phe Ser Gly Met Asp Cys Glu Glu Asp lie 

2341 900 905 910 
2342 

2343 Asp Asp Cys Leu Ala Asn Pro Cys Gin Asn Gly Gly Ser Cys Met Asp 

2344 915 920 925 
2345 

2346 Gly Val Asn Thr Phe Ser Cys Leu Cys Leu Pro Gly Phe Thr Gly Asp 
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2347 930 935 940 

2348 

2349 Lys Cys Gin Thr Asp Met Asn Glu Cys Leu Ser Glu Pro Cys Lys Asn 

2350 945 950 955 960 
2351 

23 52 Gly Gly Thr Cys Ser Asp Tyr Val Asn Ser Tyr Thr Cys Lys Cys Gin 

2353 965 970 975 

2354 

2355 Ala Gly Phe Asp Gly Val His Cys Glu Asn Asn lie Asn Glu Cys Thr 

2356 980 985 990 
2357 

2358 Glu Ser Ser Cys Phe Asn Gly Gly Thr Cys Val Asp Gly lie Asn Ser 

2359 995 1000 1005 
2360 

2361 Phe Ser Cys Leu Cys Pro Val Gly Phe Thr Gly Ser Phe Cys Leu His 

2362 1010 1015 1020 
2363 

2364 Glu lie Asn Glu Cys Ser Ser His Pro Cys Leu Asn Glu Gly Thr Cys 

2365 1025 1030 1035 1040 
2366 

23 67 Val Asp Gly Leu Gly Thr Tyr Arg Cys Ser Cys Pro Leu Gly Tyr Thr 

2368 1045 1050 1055 

2369 

23 70 Gly Lys Asn Cys Gin Thr Leu Val Asn Leu Cys Ser Arg Ser Pro Cys 

2371 1060 1065 1070 

2372 

23 73 Lys Asn Lys Gly Thr Cys Val Gin Lys Lys Ala Glu Ser Gin Cys Leu 

2374 1075 1080 1085 

2375 

23 76 Cys Pro Ser Gly Trp Ala Gly Ala Tyr Cys Asp Val Pro Asn Val Ser 

2377 1090 1095 1100 

2378 

23 79 Cys Asp lie Ala Ala Ser Arg Arg Gly Val Leu Val Glu His Leu Cys 

2380 1105 1110 1115 1120 

2381 

23 82 Gin His Ser Gly Val Cys lie Asn Ala Gly Asn Thr His Tyr Cys Gin 

2383 1125 1130 1135 

2384 

23 85 Cys Pro Leu Gly Tyr Thr Gly Ser Tyr Cys Glu Glu Gin Leu Asp Glu 

2386 1140 1145 1150 

2387 

2388 Cys Ala Ser Asn Pro Cys Gin His Gly Ala Thr Cys Ser Asp Phe lie 

2389 1155 1160 1165 
2390 

23 91 Gly Gly Tyr Arg Cys Glu Cys Val Pro Gly Tyr Gin Gly Val Asn Cys 

2392 1170 1175 1180 

2393 

23 94 Glu Tyr Glu Val Asp Glu Cys Gin Asn Gin Pro Cys Gin Asn Gly Gly 

2395 1185 1190 1195 1200 

2396 

23 97 Thr Cys lie Asp Leu Val Asn His Phe Lys Cys Ser Cys Pro Pro Gly 
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2398 1205 1210 1215 

2399 

2400 Thr Arg Gly Leu Leu Cys Glu Glu Asn lie Asp Asp Cys Ala Arg Gly 

2401 1220 1225 1230 
2402 

2403 Pro His Cys Leu Asn Gly Gly Gin Cys Met Asp Arg lie Gly Gly Tyr 

2404 1235 1240 1245 
2405 

2406 Ser Cys Arg Cys Leu Pro Gly Phe Ala Gly Glu Arg Cys Glu Gly Asp 

2407 1250 1255 1260 
2408 

2409 lie Asn Glu Cys Leu Ser Asn Pro Cys Ser Ser Glu Gly Ser Leu Asp 

2410 1265 1270 1275 1280 
2411 

2412 Cys lie Gin Leu Thr Asn Asp Tyr Leu Cys Val Cys Arg Ser Ala Phe 

2413 1285 1290 1295 
2414 

2415 Thr Gly Arg His Cys Glu Thr Phe Val Asp Val Cys Pro Gin Met Pro 

2416 1300 1305 1310 
2417 

2418 Cys Leu Asn Gly Gly Thr Cys Ala Val Ala Ser Asn Met Pro Asp Gly 

2419 1315 1320 1325 
2420 

2421 Phe lie Cys Arg Cys Pro Pro Gly Phe Ser Gly Ala Arg Cys Gin Ser 

2422 1330 1335 1340 
2423 

2424 Ser Cys Gly Gin Val Lys Cys Arg Lys Gly Glu Gin Cys Val His Thr 

2425 1345 1350 1355 1360 
2426 

242 7 Ala Ser Gly Pro Arg Cys Phe Cys Pro Ser Pro Arg Asp Cys Glu Ser 
2428 1365 1370 1375 
2429 

2430 Gly Cys Ala Ser Ser Pro Cys Gin His Gly Gly Ser Cys His Pro Gin 

2431 1380 1385 1390 
2432 

2433 Arg Gin Pro Pro Tyr Tyr Ser Cys Gin Cys Ala Pro Pro Phe Ser Gly 

2434 1395 1400 1405 
2435 

2436 Ser Arg Cys Glu Leu Tyr Thr Ala Pro Pro Ser Thr Pro Pro Ala Thr 

2437 1410 1415 1420 
2438 

243 9 Cys Leu Ser Gin Tyr Cys Ala Asp Lys Ala Arg Asp Gly Val Cys Asp 
2440 1425 1430 1435 1440 
2441 

2442 Glu Ala Cys Asn Ser His Ala Cys Gin Trp Asp Gly Gly Asp Cys Ser 

2443 1445 1450 1455 
2444 

2445 Leu Thr Met Glu Asn Pro Trp Ala Asn Cys Ser Ser Pro Leu Pro Cys 

2446 1460 1465 1470 
2447 

2448 Trp Asp Tyr lie Asn Asn Gin Cys Asp Glu Leu Cys Asn Thr Val Glu 
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2449 1475 1480 1485 

2450 

24 51 Cys Leu Phe Asp Asn Phe Glu Cys Gin Gly Asn Ser Lys Thr Cys Lys 

2452 1490 1495 1500 

2453 

24 54 Tyr Asp Lys Tyr Cys Ala Asp His Phe Lys Asp Asn His Cys Asn Gin 

2455 1505 1510 1515 1520 

2456 

2457 Gly Cys Asn Ser Glu Glu Cys Gly Trp Asp Gly Leu Asp Cys Ala Ala 

2458 1525 1530 1535 
2459 

2460 Asp Gin Pro Glu Asn Leu Ala Glu Gly Thr Leu Val He Val Val Leu 

2461 1540 1545 1550 
2462 

2463 Met Pro Pro Glu Gin Leu Leu Gin Asp Ala Arg Ser Phe Leu Arg Ala 

2464 1555 1560 1565 
2465 

2466 Leu Gly Thr Leu Leu His Thr Asn Leu Arg He Lys Arg Asp Ser Gin 

2467 1570 1575 1580 
2468 

2469 Gly Glu Leu Met Val Tyr Pro Tyr Tyr Gly Glu Lys Ser Ala Ala Met 

2470 1585 1590 1595 1600 
2471 

24 72 Lys Lys Gin Arg Met Thr Arg Arg Ser Leu Pro Gly Glu Gin Glu Gin 

2473 1605 1610 1615 

2474 

24 75 Glu Val Ala Gly Ser Lys Val Phe Leu Glu He Asp Asn Arg Gin Cys 

2476 1620 1625 1630 

2477 

24 78 Val Gin Asp Ser Asp His Cys Phe Lys Asn Thr Asp Ala Ala Ala Ala 

2479 1635 1640 1645 

2480 

2481 Leu Leu Ala Ser His Ala He Gin Gly Thr Leu Ser Tyr Pro Leu Val 

2482 1650 1655 1660 
2483 

24 84 Ser Val Val Ser Glu Ser Leu Thr Pro Glu Arg Thr Gin Leu Leu Tyr 

2485 1665 1670 1675 1680 

2486 

2487 Leu Leu Ala Val Ala Val Val He He Leu Phe He He Leu Leu Gly 

2488 1685 1690 1695 
2489 

24 90 Val He Met Ala Lys Arg Lys Arg Lys His Gly Ser Leu Trp Leu Pro 

2491 1700 1705 1710 

2492 

24 93 Glu Gly Phe Thr Leu Arg Arg Asp Ala Ser Asn His Lys Arg Arg Glu 

2494 1715 1720 1725 

2495 

24 96 Pro Val Gly Gin Asp Ala Val Gly Leu Lys Asn Leu Ser Val Gin Val 

2497 1730 1735 1740 

2498 

24 99 Ser Glu Ala Asn Leu He Gly Thr Gly Thr Ser Glu His Trp Val Asp 
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2500 1745 1750 1755 1760 

2501 

2 502 Asp Glu Gly Pro Gin Pro Lys Lys Val Lys Ala Glu Asp Glu Ala Leu 

2503 1765 1770 1775 

2504 

2505 Leu Ser Glu Glu Asp Asp Pro He Asp Arg Arg Pro Trp Thr Gin Gin 

2506 1780 1785 1790 
2507 

2508 His Leu Glu Ala Ala Asp He Arg Arg Thr Pro Ser Leu Ala Leu Thr 

2509 1795 1800 1805 
2510 

2 511 Pro Pro Gin Ala Glu Gin Glu Val Asp Val Leu Asp Val Asn Val Arg 

2512 1810 1815 1820 

2513 

2 514 Gly Pro Asp Gly Cys Thr Pro Leu Met Leu Ala Ser Leu Arg Gly Gly 

2515 1825 1830 1835 1840 

2516 

2 517 Ser Ser Asp Leu Ser Asp Glu Asp Glu Asp Ala Glu Asp Ser Ser Ala 

2518 1845 1850 1855 

2519 

2 52 0 Asn He He Thr Asp Leu Val Tyr Gin Gly Ala Ser Leu Gin Ala Gin 

2521 1860 1865 1870 

2522 

2 523 Thr Asp Arg Thr Gly Glu Met Ala Leu His Leu Ala Ala Arg Tyr Ser 

2524 1875 1880 1885 

2525 

2 526 Arg Ala Asp Ala Ala Lys Arg Leu Leu Asp Ala Gly Ala Asp Ala Asn 

2527 1890 1895 1900 

2528 

2 529 Ala Gin Asp Asn Met Gly Arg Cys Pro Leu His Ala Ala Val Ala Ala 

2530 1905 1910 1915 1920 

2531 

2 532 Asp Ala Gin Gly Val Phe Gin He Leu He Arg Asn Arg Val Thr Asp 

2533 1925 1930 1935 

2534 

2535 Leu Asp Ala Arg Met Asn Asp Gly Thr Thr Pro Leu He Leu Ala Ala 

2536 1940 1945 1950 
2537 

2 538 Arg Leu Ala Val Glu Gly Met Val Ala Glu Leu He Asn Cys Gin Ala 

2539 1955 1960 1965 

2540 

2 541 Asp Val Asn Ala Val Asp Asp His Gly Lys Ser Ala Leu His Trp Ala 

2542 1970 1975 1980 

2543 

2 544 Ala Ala Val Asn Asn Val Glu Ala Thr Leu Leu Leu Leu Lys Asn Gly 

2545 1985 1990 1995 2000 

2546 

2 547 Ala Asn Arg Asp Met Gin Asp Asn Lys Glu Glu Thr Pro Leu Phe Leu 

2548 2005 2010 2015 

2549 

2 550 Ala Ala Arg Glu Gly Ser Tyr Glu Ala Ala Lys He Leu Leu Asp His 
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2551 2020 2025 2030 

2552 

2553 Phe Ala Asn Arg Asp lie Thr Asp His Met Asp Arg Leu Pro Arg Asp 

2554 2035 2040 2045 
2555 

2556 Val Ala Arg Asp Arg Met His His Asp He Val Arg Leu Leu Asp Glu 

2557 2050 2055 2060 
2558 

2559 Tyr Asn Val Thr Pro Ser Pro Pro Gly Thr Val Leu Thr Ser Ala Leu 

2560 2065 2070 2075 2080 
2561 

2562 Ser Pro Val He Cys Gly Pro Asn Arg Ser Phe Leu Ser Leu Lys His 

2563 2085 2090 2095 
2564 

2565 Thr Pro Met Gly Lys Lys Ser Arg Arg Pro Ser Ala Lys Ser Thr Met 

2566 2100 2105 2110 
2567 

2568 Pro Thr Ser Leu Pro Asn Leu Ala Lys Glu Ala Lys Asp Ala Lys Gly 

2569 2115 2120 2125 
2570 

2 571 Ser Arg Arg Lys Lys Ser Leu Ser Glu Lys Val Gin Leu Ser Glu Ser 

2572 2130 2135 2140 

2573 

2574 Ser Val Thr Leu Ser Pro Val Asp Ser Leu Glu Ser Pro His Thr Tyr 

2575 2145 2150 2155 2160 
2576 

2577 Val Ser Asp Thr Thr Ser Ser Pro Met He Thr Ser Pro Gly He Leu 

2578 2165 2170 2175 
2579 

2580 Gin Ala Ser Pro Asn Pro Met Leu Ala Thr Ala Ala Pro Pro Ala Pro 

2581 2180 2185 2190 
2582 

2583 Val His Ala Gin His Ala Leu Ser Phe Ser Asn Leu His Glu Met Gin 

2584 2195 2200 2205 
2585 

2586 Pro Leu Ala His Gly Ala Ser Thr Val Leu Pro Ser Val Ser Gin Leu 

2587 2210 2215 2220 
2588 

2589 Leu Ser His His His He Val Ser Pro Gly Ser Gly Ser Ala Gly Ser 

2590 2225 2230 2235 2240 
2591 

2592 Leu Ser Arg Leu His Pro Val Pro Val Pro Ala Asp Trp Met Asn Arg 

2593 2245 2250 2255 
2594 

2595 Met Glu Val Asn Glu Thr Gin Tyr Asn Glu Met Phe Gly Met Val Leu 

2596 2260 2265 2270 
2597 

2 598 Ala Pro Ala Glu Gly Thr His Pro Gly He Ala Pro Gin Ser Arg Pro 

2599 2275 2280 2285 

2600 

2601 Pro Glu Gly Lys His He Thr Thr Pro Arg Glu Pro Leu Pro Pro He 
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(i) 


SEQUENCE CHARACTERISTICS: 
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(A) 


LENGTH: 2556 amino 


acids 
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(B) 


TYPE : amino acid 
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(C) 


STRANDEDNESS : single 
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(D) 


TOPOLOGY : unknown 
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(ii) 


MOLECULE TYPE: peptide 
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(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 


20 : 
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Met 


Pro 


Pro Leu Leu Ala 


Pro 


Leu 


Leu 


Cys 


Leu 


Ala 


Leu 


Leu Pro 


Ala 


2652 


1 




5 








10 








15 
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PATENT APPLICATION US/08/083,590A TIME: 12:01:28 

INPUT SET: S1376.raw 



2653 

2654 Leu Ala Ala Arg Gly Pro Arg Cys Ser Gin Pro Gly Glu Thr Cys Leu 

2655 20 25 30 
2656 

2657 Asn Gly Gly Lys Cys Glu Ala Ala Asn Gly Thr Glu Ala Cys Val Cys 

2658 35 40 45 
2659 

2660 Gly Gly Ala Phe Val Gly Pro Arg Cys Gin Asp Pro Asn Pro Cys Leu 

2661 50 55 60 
2662 

2663 Ser Thr Pro Cys Lys Asn Ala Gly Thr Cys His Val Val Asp Arg Arg 

2664 65 70 75 80 
2665 

2666 Gly Val Ala Asp Tyr Ala Cys Ser Cys Ala Leu Gly Phe Ser Gly Pro 

2667 85 90 95 
2668 

2669 Leu Cys Leu Thr Pro Leu Asp Asn Ala Cys Leu Thr Asn Pro Cys Arg 

2670 100 105 110 
2671 

2672 Asn Gly Gly Thr Cys Asp Leu Leu Thr Leu Thr Glu Tyr Lys Cys Arg 

2673 115 120 125 
2674 

2675 Cys Pro Pro Gly Trp Ser Gly Lys Ser Cys Gin Gin Ala Asp Pro Cys 

2676 130 135 140 
2677 

2678 Ala Ser Asn Pro Cys Ala Asn Gly Gly Gin Cys Leu Pro Phe Glu Ala 

2679 145 150 155 160 
2680 

2681 Ser Tyr lie Cys His Cys Pro Pro Ser Phe His Gly Pro Thr Cys Arg 

2682 165 170 175 
2683 

2684 Gin Asp Val Asn Glu Cys Gly Gin Lys Pro Arg Leu Cys Arg His Gly 

2685 180 185 190 
2686 

2687 Gly Thr Cys His Asn Glu Val Gly Ser Tyr Arg Cys Val Cys Arg Ala 

2688 195 200 205 
2689 

2690 Thr His Thr Gly Pro Asn Cys Glu Arg Pro Tyr Val Pro Cys Ser Pro 

2691 210 215 220 
2692 

2693 Ser Pro Cys Gin Asn Gly Gly Thr Cys Arg Pro Thr Gly Asp Val Thr 

2694 225 230 235 240 
2695 

26 96 His Glu Cys Ala Cys Leu Pro Gly Phe Thr Gly Gin Asn Cys Glu Glu 

2697 245 250 255 

2698 

2699 Asn lie Asp Asp Cys Pro Gly Asn Asn Cys Lys Asn Gly Gly Ala Cys 

2700 260 265 270 
2701 

2702 Val Asp Gly Val Asn Thr Tyr Asn Cys Pro Cys Pro Pro Glu Trp Thr 

2703 275 280 285 



PAGE: 54 RAW SEQUENCE LISTING DATE: 02/04/94 

PATENT APPLICATION US/08/083,590A TIME: 12:01:34 

INPUT SET: S1376.raw 



2704 

2 705 Gly Gin Tyr Cys Thr Glu Asp Val Asp Glu Cys Gin Leu Met Pro Asn 

2706 290 295 300 

2707 

2708 Ala Cys Gin Asn Gly Gly Thr Cys His Asn Thr His Gly Gly Tyr Asn 

2709 305 310 315 320 
2710 

2711 Cys Val Cys Val Asn Gly Trp Thr Gly Glu Asp Cys Ser Glu Asn lie 

2712 325 330 335 
2713 

2 714 Asp Asp Cys Ala Ser Ala Ala Cys Phe His Gly Ala Thr Cys His Asp 

2715 340 345 350 

2716 

2 717 Arg Val Ala Ser Phe Tyr Cys Glu Cys Pro His Gly Arg Thr Gly Leu 

2718 355 360 365 

2719 

2720 Leu Cys His Leu Asn Asp Ala Cys He Ser Asn Pro Cys Asn Glu Gly 

2721 370 375 380 
2722 

2 723 Ser Asn Cys Asp Thr Asn Pro Val Asn Gly Lys Ala He Cys Thr Cys 

2724 385 390 395 400 

2725 

2 726 Pro Ser Gly Tyr Thr Gly Pro Ala Cys Ser Gin Asp Val Asp Glu Cys 

2727 405 410 415 

2728 

2 729 Ser Leu Gly Ala Asn Pro Cys Glu His Ala Gly Lys Cys He Asn Thr 

2730 420 425 430 

2731 

2 732 Leu Gly Ser Phe Glu Cys Gin Cys Leu Gin Gly Tyr Thr Gly Pro Arg 

2733 435 440 445 

2734 

2 735 Cys Glu He Asp Val Asn Glu Cys Val Ser Asn Pro Cys Gin Asn Asp 

2736 450 455 460 

2737 

2 738 Ala Thr Cys Leu Asp Gin He Gly Glu Phe Gin Cys Met Cys Met Pro 

2739 465 470 475 480 

2740 

2 741 Gly Tyr Glu Gly Val His Cys Glu Val Asn Thr Asp Glu Cys Ala Ser 

2742 485 490 495 

2743 

2744 Ser Pro Cys Leu His Asn Gly Arg Cys Leu Asp Lys He Asn Glu Phe 

2745 500 505 510 
2746 

2 747 Gin Cys Glu Cys Pro Thr Gly Phe Thr Gly His Leu Cys Gin Tyr Asp 

2748 515 520 525 

2749 

2 750 Val Asp Glu Cys Ala Ser Thr Pro Cys Lys Asn Gly Ala Lys Cys Leu 

2751 530 535 540 

2752 

2 753 Asp Gly Pro Asn Thr Tyr Thr Cys Val' Cys Thr Glu Gly Tyr Thr Gly 

2754 545 550 555 560 
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2760 
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PATENT APPLICATION US/08/083,590A TIME: 12:01:41 



INPUT SET: S1376.raw 



Thr His Cys Glu Val Asp lie Asp Glu Cys Asp Pro Asp Pro Cys His 
565 570 575 

Tyr Gly Ser Cys Lys Asp Gly Val Ala Thr Phe Thr Cys Leu Cys Arg 
580 585 590 

Pro Gly Tyr Thr Gly His His Cys Glu Thr Asn lie Asn Glu Cys Ser 
595 600 605 

Ser Gin Pro Cys Arg Leu Arg Gly Thr Cys Gin Asp Pro Asp Asn Ala 
610 615 620 

Tyr Leu Cys Phe Cys Leu Lys Gly Thr Thr Gly Pro Asn Cys Glu lie 
625 630 635 640 

Asn Leu Asp Asp Cys Ala Ser Ser Pro Cys Asp Ser Gly Thr Cys Leu 
645 650 655 

Asp Lys lie Asp Gly Tyr Glu Cys Ala Cys Glu Pro Gly Tyr Thr Gly 
660 665 670 

Ser Met Cys Asn Ser Asn lie Asp Glu Cys Ala Gly Asn Pro Cys His 
675 680 685 

Asn Gly Gly Thr Cys Glu Asp Gly lie Asn Gly Phe Thr Cys Arg Cys 
690 695 700 

Pro Glu Gly Tyr His Asp Pro Thr Cys Leu Ser Glu Val Asn Glu Cys 
705 710 715 720 

Asn Ser Asn Pro Cys Val His Gly Ala Cys Arg Asp Ser Leu Asn Gly 
725 730 735 

Tyr Lys Cys Asp Cys Asp Pro Gly Trp Ser Gly Thr Asn Cys Asp lie 
740 745 750 

Asn Asn Asn Glu Cys Glu Ser Asn Pro Cys Val Asn Gly Gly Thr Cys 
755 760 765 

Lys Asp Met Thr Ser Gly lie Val Cys Thr Cys Arg Glu Gly Phe Ser 
770 775 780 

Gly Pro Asn Cys Gin Thr Asn lie Asn Glu Cys Ala Ser Asn Pro Cys 
785 790 795 800 

Leu Asn Lys Gly Thr Cys lie Asp Asp Val Ala Gly Tyr Lys Cys Asn 
805 810 815 

Cys Leu Leu Pro Tyr Thr Gly Ala Thr Cys Glu Val Val Leu Ala Pro 
820 825 830 
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PATENT APPLICATION US/08/083, 590A TIME: 12:01:48 

INPUT SET: S1376.raw 



Cys Ala Pro Ser Pro Cys Arg Asn Gly Gly Glu Cys Arg Gin Ser Glu 
835 840 845 

Asp Tyr Glu Ser Phe Ser Cys Val Cys Pro Thr Ala Gly Ala Lys Gly 
850 855 860 

Gin Thr Cys Glu Val Asp lie Asn Glu Cys Val Leu Ser Pro Cys Arg 
865 870 875 880 

His Gly Ala Ser Cys Gin Asn Thr His Gly Gly Tyr Arg Cys His Cys 
885 890 895 

Gin Ala Gly Tyr Ser Gly Arg Asn Cys Glu Thr Asp lie Asp Asp Cys 
900 905 910 

Arg Pro Asn Pro Cys His Asn Gly Gly Ser Cys Thr Asp Gly lie Asn 
915 920 925 

Thr Ala Phe Cys Asp Cys Leu Pro Gly Phe Arg Gly Thr Phe Cys Glu 
930 935 940 

Glu Asp lie Asn Glu Cys Ala Ser Asp Pro Cys Arg Asn Gly Ala Asn 
945 950 955 960 

Cys Thr Asp Cys Val Asp Ser Tyr Thr Cys Thr Cys Pro Ala Gly Phe 
965 970 975 

Ser Gly lie His Cys Glu Asn Asn Thr Pro Asp Cys Thr Glu Ser Ser 
980 985 990 

Cys Phe Asn Gly Gly Thr Cys Val Asp Gly He Asn Ser Phe Thr Cys 
995 1000 1005 

Leu Cys Pro Pro Gly Phe Thr Gly Ser Tyr Cys Gin His Val Val Asn 
1010 1015 1020 

Glu Cys Asp Ser Arg Pro Cys Leu Leu Gly Gly Thr Cys Gin Asp Gly 
1025 1030 1035 1040 

Arg Gly Leu His Arg Cys Thr Cys Pro Gin Gly Tyr Thr Gly Pro Asn 
1045 1050 1055 

Cys Gin Asn Leu Val His Trp Cys Asp Ser Ser Pro Cys Lys Asn Gly 
1060 1065 1070 

Gly Lys Cys Trp Gin Thr His Thr Gin Tyr Arg Cys Glu Cys Pro Ser 
1075 1080 1085 

Gly Trp Thr Gly Leu Tyr Cys Asp Val Pro Ser Val Ser Cys Glu Val 
1090 1095 1100 
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PATENT APPLICATION US/08/083, 590A TIME: 12:01:55 

INPUT SET: S1376.raw 



2857 

2858 Ala Ala Gin Arg Gin Gly Val Asp Val Ala Arg Leu Cys Gin His Gly 

2859 1105 1110 1115 1120 
2860 

2861 Gly Leu Cys Val Asp Ala Gly Asn Thr His His Cys Arg Cys Gin Ala 

2862 1125 1130 1135 
2863 

2864 Gly Tyr Thr Gly Ser Tyr Cys Glu Asp Leu Val Asp Glu Cys Ser Pro 

2865 1140 1145 1150 
2866 

2867 Ser Pro Cys Gin Asn Gly Ala Thr Cys Thr Asp Tyr Leu Gly Gly Tyr 

2868 1155 1160 1165 
2869 

2870 Ser Cys Lys Cys Val Ala Gly Tyr His Gly Val Asn Cys Ser Glu Glu 

2871 1170 1175 1180 
2872 

2873 lie Asp Glu Cys Leu Ser His Pro Cys Gin Asn Gly Gly Thr Cys Leu 

2874 1185 1190 1195 1200 
2875 

2876 Asp Leu Pro Asn Thr Tyr Lys Cys Ser Cys Pro Arg Gly Thr Gin Gly 

2877 1205 1210 1215 
2878 

2879 Val His Cys Glu lie Asn Val Asp Asp Cys Asn Pro Pro Val Asp Pro 

2880 1220 1225 1230 
2881 

2882 Val Ser Arg Ser Pro Lys Cys Phe Asn Asn Gly Thr Cys Val Asp Gin 

2883 1235 1240 1245 
2884 

2885 Val Gly Gly Tyr Ser Cys Thr Cys Pro Pro Gly Phe Val Gly Glu Arg 

2886 1250 1255 1260 
2887 

2888 Cys Glu Gly Asp Val Asn Glu Cys Leu Ser Asn Pro Cys Asp Ala Arg 

2889 1265 1270 1275 1280 
2890 

2891 Gly Thr Gin Asn Cys Val Gin Arg Val Asn Asp Phe His Cys Glu Cys 

2892 1285 1290 1295 
2893 

2894 Arg Ala Gly His Thr Gly Arg Arg Cys Glu Ser Val lie Asn Gly Cys 

2895 1300 1305 1310 
2896 

2897 Lys Gly Lys Pro Cys Lys Asn Gly Gly Thr Cys Ala Val Ala Ser Asn 

2898 1315 1320 1325 
2899 

2900 Thr Ala Arg Gly Phe lie Cys Lys Cys Pro Ala Gly Phe Glu Gly Ala 

2901 1330 1335 1340 
2902 

2903 Thr Cys Glu Asn Asp Ala Arg Thr Cys Gly Ser Leu Arg Cys Leu Asn 

2904 1345 1350 1355 1360 
2905 

2906 Gly Gly Thr Cys lie Ser Gly Pro Arg Ser Pro Thr Cys Leu Cys Leu 

2907 1365 1370 1375 
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INPUT SET: S1376.raw 



2908 

2 909 Gly Pro Phe Thr Gly Pro Glu Cys Gin Phe Pro Ala Ser Ser Pro Cys 

2910 1380 1385 1390 

2911 

2912 Leu Gly Gly Asn Pro Cys Tyr Asn Gin Gly Thr Cys Glu Pro Thr Ser 

2913 1395 1400 1405 
2914 

2915 Glu Ser Pro Phe Tyr Arg Cys Leu Cys Pro Ala Lys Phe Asn Gly Leu 

2916 1410 1415 1420 
2917 

2918 Leu Cys His lie Leu Asp Tyr Ser Phe Gly Gly Gly Ala Gly Arg Asp 

2919 1425 1430 1435 1440 
2920 

2921 lie Pro Pro Pro Leu lie Glu Glu Ala Cys Glu Leu Pro Glu Cys Gin 

2922 1445 1450 1455 
2923 

2924 Glu Asp Ala Gly Asn Lys Val Cys Ser Leu Gin Cys Asn Asn His Ala 

2925 1460 1465 1470 
2926 

292 7 Cys Gly Trp Asp Gly Gly Asp Cys Ser Leu Asn Phe Asn Asp Pro Trp 
2928 1475 1480 ' 1485 

2929 

2930 Lys Asn Cys Thr Gin Ser Leu Gin Cys Trp Lys Tyr Phe Ser Asp Gly 

2931 1490 1495 1500 
2932 

2933 His Cys Asp Ser Gin Cys Asn Ser Ala Gly Cys Leu Phe Asp Gly Phe 

2934 1505 1510 1515 1520 
2935 

2936 Asp Cys Gin Arg Ala Glu Gly Gin Cys Asn Pro Leu Tyr Asp Gin Tyr 

2937 1525 1530 1535 
2938 

293 9 Cys Lys Asp His Phe Ser Asp Gly His Cys Asp Gin Gly Cys Asn Ser 
2940 1540 1545 1550 

2941 

2942 Ala Glu Cys Glu Trp Asp Gly Leu Asp Cys Ala Glu His Val Pro Glu 

2943 1555 1560 1565 
2944 

2945 Arg Leu Ala Ala Gly Thr Leu Val Val Val Val Leu Met Pro Pro Glu 

2946 1570 1575 1580 
2947 

2948 Gin Leu Arg Asn Ser Ser Phe His Phe Leu Arg Glu Leu Ser Arg Val 

2949 1585 1590 1595 1600 
2950 

2 951 Leu His Thr Asn Val Val Phe Lys Arg Asp Ala His Gly Gin Gin Met 

2952 1605 1610 1615 

2953 

2954 lie Phe Pro Tyr Tyr Gly Arg Glu Glu Glu Leu Arg Lys His Pro lie 

2955 1620 1625 1630 
2956 

2 957 Lys Arg Ala Ala Glu Gly Trp Ala Ala Pro Asp Ala Leu Leu Gly Gin 

2958 1635 1640 1645 
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INPUT SET: S1376.raw 



2959 

2960 Val Lys Ala Ser Leu Leu Pro Gly Gly Ser Glu Gly Gly Arg Arg Arg 

2961 1650 1655 1660 
2962 

2963 Arg Glu Leu Asp Pro Met Asp Val Arg Gly Ser lie Val Tyr Leu Glu 

2964 1665 1670 1675 1680 
2965 

2 966 lie Asp Asn Arg Gin Cys Val Gin Ala Ser Ser Gin Cys Phe Gin Ser 

2967 1685 1690 1695 

2968 

2 969 Ala Thr Asp Val Ala Ala Phe Leu Gly Ala Leu Ala Ser Leu Gly Ser 

2970 1700 1705 1710 

2971 

2 972 Leu Asn lie Pro Tyr Lys lie Glu Ala Val Gin Ser Glu Thr Val Glu 

2973 1715 1720 1725 

2974 

2975 Pro Pro Pro Pro Ala Gin Leu His Phe Met Tyr Val Ala Ala Ala Ala 

2976 1730 1735 1740 
2977 

2978 Phe Val Leu Leu Phe Phe Val Gly Cys Gly Val Leu Leu Ser Arg Lys 

2979 1745 1750 1755 1760 
2980 

2981 Arg Arg Arg Gin His Gly Gin Leu Trp Phe Pro Glu Gly Phe Lys Val 

2982 1765 1770 1775 
2983 

2984 Ser Glu Ala Ser Lys Lys Lys Arg Arg Glu Glu Leu Gly Glu Asp Ser 

2985 1780 1785 1790 
2986 

2987 Val Gly Leu Lys Pro Leu Lys Asn Ala Ser Asp Gly Ala Leu Met Asp 

2988 1795 1800 1805 
2989 

2 990 Asp Asn Gin Asn Glu Trp Gly Asp Glu Asp Leu Glu Thr Lys Lys Phe 

2991 1810 1815 1820 

2992 

2993 Arg Phe Glu Glu Pro Val Val Leu Pro Asp Leu Asp Asp Gin Thr Asp 

2994 1825 1830 1835 1840 
2995 

2996 His Arg Gin Trp Thr Gin Gin His Leu Asp Ala Ala Asp Leu Arg Met 

2997 1845 1850 1855 
2998 

2999 Ser Ala Met Ala Pro Thr Pro Pro Gin Gly Glu Val Asp Ala Asp Cys 

3000 1860 1865 1870 
3001 

3002 Met Asp Val Asn Val Arg Gly Pro Asp Gly Phe Thr Pro Leu Met lie 

3003 1875 1880 1885 
3004 

3005 Ala Ser Cys Ser Gly Gly Gly Leu Glu Thr Gly Asn Ser Glu Glu Glu 

3006 1890 1895 1900 
3007 

3008 Glu Asp Ala Pro Ala Val lie Ser Asp Phe lie Tyr Gin Gly Ala Ser 

3009 1905 1910 1915 1920 
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INPUT SET: S1376.raw 



Leu His Asn Gin Thr Asp Arg Thr Gly Glu Thr Ala Leu His Leu Ala 
1925 1930 1935 

Ala Arg Tyr Ser Arg Ser Asp Ala Ala Lys Arg Leu Leu Glu Ala Ser 
1940 1945 1950 

Ala Asp Ala Asn lie Gin Asp Asn Met Gly Arg Thr Pro Leu His Ala 
1955 1960 1965 

Ala Val Ser Ala Asp Ala Gin Gly Val Phe Gin lie Leu lie Arg Asn 
1970 1975 1980 

Arg Ala Thr Asp Leu Asp Ala Arg Met His Asp Gly Thr Thr Pro Leu 
1985 1990 1995 2000 

lie Leu Ala Ala Arg Leu Ala Val Glu Gly Met Leu Glu Asp Leu lie 
2005 2010 2015 

Asn Ser His Ala Asp Val Asn Ala Val Asp Asp Leu Gly Lys Ser Ala 
2020 2025 2030 

Leu His Trp Ala Ala Ala Val Asn Asn Val Asp Ala Ala Val Val Leu 
2035 2040 2045 

Leu Lys Asn Gly Ala Asn Lys Asp Met Gin Asn Asn Arg Glu Glu Thr 
2050 2055 2060 

Pro Leu Phe Leu Ala Ala Arg Glu Gly Ser Tyr Glu Thr Ala Lys Val 
2065 2070 2075 2080 

Leu Leu Asp His Phe Ala Asn Arg Asp lie Thr Asp His Met Asp Arg 
2085 2090 2095 

Leu Pro Arg Asp lie Ala Gin Glu Arg Met His His Asp lie Val Arg 
2100 2105 2110 

Leu Leu Asp Glu Tyr Asn Leu Val Arg Ser Pro Gin Leu His Gly Ala 
2115 2120 2125 

Pro Leu Gly Gly Thr Pro Thr Leu Ser Pro Pro Leu Cys Ser Pro Asn 
2130 2135 2140 

Gly Tyr Leu Gly Ser Leu Lys Pro Gly Val Gin Gly Lys Lys Val Arg 
2145 2150 2155 2160 

Lys Pro Ser Ser Lys Gly Leu Ala Cys Gly Ser Lys Glu Ala Lys Asp 
2165 2170 2175 

Leu Lys Ala Arg Arg Lys Lys Ser Gin Asp Gly Lys Gly Cys Leu Leu 
2180 2185 2190 
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INPUT SET: S1376.raw 



Asp Ser Ser Gly Met Leu Ser Pro Val Asp Ser Leu Glu Ser Pro His 
2195 2200 2205 

Gly Tyr Leu Ser Asp Val Ala Ser Pro Pro Leu Leu Pro Ser Pro Phe 
2210 2215 2220 

Gin Gin Ser Pro Ser Val Pro Leu Asn His Leu Pro Gly Met Pro Asp 
2225 2230 2235 2240 

Thr His Leu Gly lie Gly His Leu Asn Val Ala Ala Lys Pro Glu Met 
2245 2250 2255 

Ala Ala Leu Gly Gly Gly Gly Arg Leu Ala Phe Glu Thr Gly Pro Pro 
2260 2265 2270 

Arg Leu Ser His Leu Pro Val Ala Ser Gly Thr Ser Thr Val Leu Gly 
2275 2280 2285 

Ser Ser Ser Gly Gly Ala Leu Asn Phe Thr Val Gly Gly Ser Thr Ser 
2290 2295 2300 

Leu Asn Gly Gin Cys Glu Trp Leu Ser Arg Leu Gin Ser Gly Met Val 
2305 2310 2315 2320 

Pro Asn Gin Tyr Asn Pro Leu Arg Gly Ser Val Ala Pro Gly Pro Leu 
2325 2330 2335 

Ser Thr Gin Ala Pro Ser Leu Gin His Gly Met Val Gly Pro Leu His 
2340 2345 2350 

Ser Ser Leu Ala Ala Ser Ala Leu Ser Gin Met Met Ser Tyr Gin Gly 
2355 2360 2365 

Leu Pro Ser Thr Arg Leu Ala Thr Gin Pro His Leu Val Gin Thr Gin 
2370 2375 2380 

Gin Val Gin Pro Gin Asn Leu Gin Met Gin Gin Gin Asn Leu Gin Pro 
2385 2390 2395 2400 

Ala Asn lie Gin Gin Gin Gin Ser Leu Gin Pro Pro Pro Pro Pro Pro 
2405 2410 2415 

Gin Pro His Leu Gly Val Ser Ser Ala Ala Ser Gly His Leu Gly Arg 
2420 2425 2430. 

Ser Phe Leu Ser Gly Glu Pro Ser Gin Ala Asp Val Gin Pro Leu Gly 
2435 2440 2445 

Pro Ser Ser Leu Ala Val His Thr lie Leu Pro Gin Glu Ser Pro Ala 
2450 2455 2460 
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PATENT APPLICATION US/08/083,590A TIME: 12:02:28 

INPUT SET: S1376.raw 

3112 

3113 Leu Pro Thr Ser Leu Pro Ser Ser Leu Val Pro Pro Val Thr Ala Ala 

3114 2465 2470 2475 2480 
3115 

3116 Gin Phe Leu Thr Pro Pro Ser Gin His Ser Tyr Ser Ser Pro Val Glu 

3117 2485 2490 2495 
3118 

3119 Asn Thr Pro Ser His Gin Leu Gin Val Pro Glu His Pro Phe Leu Thr 

3120 2500 2505 2510 
3121 

3122 Pro Ser Pro Glu Ser Pro Asp Gin Trp Ser Ser Ser Ser Pro His Ser 

3123 2515 2520 2525 
3124 

3125 Asn Val Ser Asp Trp Ser Glu Gly Val Ser Ser Pro Pro Thr Ser Met 

3126 2530 2535 2540 
3127 

3128 Gin Ser Gin lie Ala Arg lie Pro Glu Ala Phe Lys 

3129 2545 2550 2555 
3130 

3131 (2) INFORMATION FOR SEQ ID NO:21: 
3132 

3133 (i) SEQUENCE CHARACTERISTICS: 

3134 (A) LENGTH: 9723 base pairs 

3135 (B) TYPE: nucleic acid 

3136 (C) STRANDEDNESS : double 

3137 (D) TOPOLOGY: unknown 
3138 

3139 (ii) MOLECULE TYPE: cDNA 

3140 

3141 

3142 (ix) FEATURE: 

3143 (A) NAME /KEY: CDS 

3144 (B) LOCATION: 10 .. 7419 
3145 

3146 

3147 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

3148 

314 9 GGAATTCCG CCC GCC CTG CGC CCC GCT CTG CTG TGG GCG CTG CTG GCG 48 

3150 Pro Ala Leu Arg Pro Ala Leu Leu Trp Ala Leu Leu Ala 

3151 15 10 
3152 

3153 CTC TGG CTG TGC TGC GCG GCC CCC GCG CAT GCA TTG CAG TGT CGA GAT 96 

3154 Leu Trp Leu Cys Cys Ala Ala Pro Ala His Ala Leu Gin Cys Arg Asp 

3155 15 20 25 
3156 

3157 GGC TAT GAA CCC TGT GTA AAT GAA GGA ATG TGT GTT ACC TAC CAC AAT 144 

3158 Gly Tyr Glu Pro Cys Val Asn Glu Gly Met Cys Val Thr Tyr His Asn 

3159 30 35 40 45 
3160 

3161 GGC ACA GGA TAC TGC AAA TGT CCA GAA GGC TTC TTG GGG GAA TAT TGT 192 

3162 Gly Thr Gly Tyr Cys Lys Cys Pro Glu Gly Phe Leu Gly Glu Tyr Cys 
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3163 50 
3164 

3165 CAA CAT CGA GAC CCC 

3166 Gin His Arg Asp Pro 

3167 65 
3168 

3169 TGT GTG GCC CAG GCC 

3170 Cys Val Ala Gin Ala 

3171 80 
3172 

3173 GGG TTT ACA GGA GAG 

3174 Gly Phe Thr Gly Glu 

3175 95 
3176 

3177 GTG TCT CGA CCC TGC 

3178 Val Ser Arg Pro Cys 

3179 110 
3180 

3181 GAT ACC TAT GAG TGC 

3182 Asp Thr Tyr Glu Cys 

3183 130 
3184 

3185 CAA TGG ACG GAT GCC 

3186 Gin Trp Thr Asp Ala 

3187 145 
3188 

3189 TGT ACC ACT GTG GCC 

3190 Cys Thr Thr Val Ala 

3191 160 
3192 

3193 ACA GGG CAG AAA TGT 

3194 Thr Gly Gin Lys Cys 

3195 175 
3196 

3197 CAC TGC CAG CAT GGT 

3198 His Cys Gin His Gly 

3199 190 
3200 

32 01 TGC CAG TGC CCT CAG 

3202 Cys Gin Cys Pro Gin 

3203 210 
3204 

3205 GTG CCC TGT GCA CCC 

3206 Val Pro Cys Ala Pro 

3207 225 
3208 

3209 ACT GGT GAC TTC ACT 

3210 Thr Gly Asp Phe Thr 

3211 240 
3212 

3213 AGC ACC TGT GAG AGG 



55 

TGT GAG AAG AAC CGC TGC 
Cys Glu Lys Asn Arg Cys 
70 

ATG CTG GGG AAA GCC ACG 
Met Leu Gly Lys Ala Thr 
85 

GAC TGC CAG TAC TCA ACA 
Asp Cys Gin Tyr Ser Thr 
100 

CTG AAT GGC GGC ACA TGC 
Leu Asn Gly Gly Thr Cys 
115 120 

ACC TGT CAA GTC GGG TTT 
Thr Cys Gin Val Gly Phe 
135 

TGC CTG TCT CAT CCC TGT 
Cys Leu Ser His Pro Cys 
150 

AAC CAG TTC TCC TGC AAA 
Asn Gin Phe Ser Cys Lys 
165 

GAG ACT GAT GTC AAT GAG 
Glu Thr Asp Val Asn Glu 
180 

GGC ACC TGC CTC AAC CTG 
Gly Thr Cys Leu Asn Leu 
195 200 

GGC TTC ACA GGC CAG TAC 
Gly Phe Thr Gly Gin Tyr 
215 

TCA CCT TGT GTC AAT GGA 
Ser Pro Cys Val Asn Gly 
230 

TTT GAG TGC AAC TGC CTT 
Phe Glu Cys Asn Cys Leu 
245 

AAT ATT GAT GAC TGC CCT 



60 

CAG AAT GGT GGG ACT 240 
Gin Asn Gly Gly Thr 
75 

TGC CGA TGT GCC TCA 288 
Cys Arg Cys Ala Ser 
90 

TCT CAT CCA TGC TTT 336 

Ser His Pro Cys Phe 

105 

CAT ATG CTC AGC CGG 384 
His Met Leu Ser Arg 
125 

ACA GGT AAG GAG TGC 432 
Thr Gly Lys Glu Cys 
140 

GCA AAT GGA AGT ACC 480 
Ala Asn Gly Ser Thr 
155 

TGC CTC ACA GGC TTC 528 
Cys Leu Thr Gly Phe 
170 

TGT GAC ATT CCA GGA 576 

Cys Asp lie Pro Gly 

185 

CCT GGT TCC TAC CAG 624 
Pro Gly Ser Tyr Gin 
205 

TGT GAC AGC CTG TAT 672 
Cys Asp Ser Leu Tyr 
220 

GGC ACC TGT CGG CAG 72 0 

Gly Thr Cys Arg Gin 
235 

CCA GGT TTT GAA GGG 768 
Pro Gly Phe Glu Gly 
250 

AAC CAC AGG TGT CAG 816 
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3214 Ser Thr Cys Glu Arg Asn lie Asp Asp Cys Pro Asn His Arg Cys Gin 

3215 255 260 265 
3216 

3217 AAT GGA GGG GTT TGT GTG GAT GGG GTC AAC ACT TAC AAC TGC CGC TGT 864 

3218 Asn Gly Gly Val Cys Val Asp Gly Val Asn Thr Tyr Asn Cys Arg Cys 

3219 270 275 280 285 
3220 

3221 CCC CCA CAA TGG ACA GGA CAG TTC TGC ACA GAG GAT GTG GAT GAA TGC 912 

3222 Pro Pro Gin Trp Thr Gly Gin Phe Cys Thr Glu Asp Val Asp Glu Cys 

3223 290 295 300 
3224 

3225 CTG CTG CAG CCC AAT GCC TGT CAA AAT GGG GGC ACC TGT GCC AAC CGC 960 

3226 Leu Leu Gin Pro Asn Ala Cys Gin Asn Gly Gly Thr Cys Ala Asn Arg 

3227 305 310 315 
3228 

3229 AAT GGA GGC TAT GGC TGT GTA TGT GTC AAC GGC TGG AGT GGA GAT GAC 1008 

3230 Asn Gly Gly Tyr Gly Cys Val Cys Val Asn Gly Trp Ser Gly Asp Asp 

3231 320 325 330 
3232 

3233 TGC AGT GAG AAC ATT GAT GAT TGT GCC TTC GCC TCC TGT ACT CCA GGC 1056 

3234 Cys Ser Glu Asn He Asp Asp Cys Ala Phe Ala Ser Cys Thr Pro Gly 

3235 335 340 345 
3236 

323 7 TCC ACC TGC ATC GAC CGT GTG GCC TCC TTC TCT TGC ATG TGC CCA GAG 1104 

3238 Ser Thr Cys He Asp Arg Val Ala Ser Phe Ser Cys Met Cys Pro Glu 

3239 350 355 360 365 
3240 

3241 GGG AAG GCA GGT CTC CTG TGT CAT CTG GAT GAT GCA TGC ATC AGC AAT 1152 

3242 Gly Lys Ala Gly Leu Leu Cys His Leu Asp Asp Ala Cys He Ser Asn 

3243 370 375 380 
3244 

3245 CCT TGC CAC AAG GGG GCA CTG TGT GAC ACC AAC CCC CTA AAT GGG CAA 1200 

3246 Pro Cys His Lys Gly Ala Leu Cys Asp Thr Asn Pro Leu Asn Gly Gin 

3247 385 390 395 
3248 

3249 TAT ATT TGC ACC TGC CCA CAA GGC TAC AAA GGG GCT GAC TGC ACA GAA 1248 

3250 Tyr He Cys Thr Cys Pro Gin Gly Tyr Lys Gly Ala Asp Cys Thr Glu 

3251 400 405 410 
3252 

3253 GAT GTG GAT GAA TGT GCC ATG GCC AAT AGC AAT CCT TGT GAG CAT GCA 1296 

3254 Asp Val Asp Glu Cys Ala Met Ala Asn Ser Asn Pro Cys Glu His Ala 

3255 415 420 425 
3256 

3257 GGA AAA TGT GTG AAC ACG GAT GGC GCC TTC CAC TGT GAG TGT CTG AAG 1344 

3258 Gly Lys Cys Val Asn Thr Asp Gly Ala Phe His Cys Glu Cys Leu Lys 

3259 430 435 440 445 
3260 

3261 GGT TAT GCA GGA CCT CGT TGT GAG ATG GAC ATC AAT GAG TGC CAT TCA 13 92 

3262 Gly Tyr Ala Gly Pro Arg Cys Glu Met Asp He Asn Glu Cys His Ser 

3263 450 455 460 
3264 
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3265 GAC CCC 

3266 Asp Pro 
3267 

3268 

3269 ACA TGT 

3270 Thr Cys 
3271 

3272 

3273 ATA AAT 

3274 He Asn 

3275 495 
3276 

3277 GAT AAA 

3278 Asp Lys 

3279 510 
3280 

3281 CCA GTT 

3282 Pro Val 
3283 

3284 

3285 AAT GGG 

3286 Asn Gly 
3287 

3288 

3289 GCC ACA 
32 90 Ala Thr 
3291 
3292 

3293 GAC CCC 

3294 Asp Pro 

3295 575 
3296 

3297 TAC ACC 

3298 Tyr Thr 

3299 590 
3300 

3301 CAG ATT 

3302 Gin He 
3303 

3304 

3305 ATT GAC 

3306 He Asp 
3307 

3308 

3309 GGG GTT 

3310 Gly Val 
3311 

3312 

3 313 ATC CAT 

3314 He His 

3315 655 



TGC CAG AAT GAT 
Cys Gin Asn Asp 
465 

CTG TGC ATG CCA 
Leu Cys Met Pro 
480 

GAA TGT CAG AGC 
Glu Cys Gin Ser 



GTC AAT CGT TTC 
Val Asn Arg Phe 
515 

TGC CAG ATT GAT 
Cys Gin He Asp 
530 

GCA AAG TGT ATC 
Ala Lys Cys He 
545 

GGT TTC ACT GGT 
Gly Phe Thr Gly 
560 

GAT CCT TGC CAC 
Asp Pro Cys His 



TGC ATC TGC AAT 
Cys He Cys Asn 
595 

GAT GAA TGT TAC 
Asp Glu Cys Tyr 
610 

CTG GTC AAT GGC 
Leu Val Asn Gly 
625 

AAT TGT GAA ATT 
Asn Cys Glu He 
640 

GGA ATC TGT ATG 
Gly He Cys Met 



GCT ACC TGT CTG 
Ala Thr Cys Leu 
470 

GGT TTC AAA GGT 
Gly Phe Lys Gly 
485 

AAC CCT TGT GTG 
Asn Pro Cys Val 
500 

CAG TGC CTG TGT 
Gin Cys Leu Cys 



ATT GAT GAC TGT 
He Asp Asp Cys 
535 

GAT CAC CCG AAT 
Asp His Pro Asn 
550 

GTG TTG TGT GAG 
Val Leu Cys Glu 
565 

CAT GGT CAG TGT 
His Gly Gin Cys 
580 

CCC GGG TAC ATG 
Pro Gly Tyr Met 



AGC AGC CCT TGC 
Ser Ser Pro Cys 
615 

TAC CAG TGC AAC 
Tyr Gin Cys Asn 
630 

AAT TTT GAT GAC 
Asn Phe Asp Asp 
645 

GAT GGC ATT AAT 
Asp Gly He Asn 
660 



GAT AAG ATT GGA 
Asp Lys He Gly 
475 

GTG CAT TGT GAA 
Val His Cys Glu 
490 

AAC AAT GGG CAG 
Asn Asn Gly Gin 
505 

CCT CCT GGT TTC 
Pro Pro Gly Phe 
520 

TCC AGT ACT CCG 
Ser Ser Thr Pro 



GGC TAT GAA TGC 
Gly Tyr Glu Cys 
555 

GAG AAC ATT GAC 
Glu Asn He Asp 
570 

CAG GAT GGT ATT 
Gin Asp Gly He 
585 

GGC GCC ATC TGC 
Gly Ala He Cys 
600 

CTG AAC GAT GGT 
Leu Asn Asp Gly 



TGC CAG CCA GGC 
Cys Gin Pro Gly 
635 

TGT GCA AGT AAC 
Cys Ala Ser Asn 
650 

CGC TAC AGT TGT 
Arg Tyr Ser Cys 
665 



GGC TTC 1440 
Gly Phe 



TTA GAA 1488 
Leu Glu 



TGT GTG 1536 
Cys Val 



ACT GGG 1584 
Thr Gly 
525 

TGT CTG 1632 

Cys Leu 

540 

CAG TGT 1680 
Gin Cys 



AAC TGT 1728 
Asn Cys 



GAT TCC 1776 
Asp Ser 



AGT GAC 1824 
Ser Asp 
605 

CGC TGC 1872 

Arg Cys 

620 

ACG TCA 1920 
Thr Ser 



CCT TGT 1968 
Pro Cys 



GTC TGC 2 016 

Val Cys 
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3316 

3317 TCA CCA GGA TTC ACA GGG CAG AGA TGT AAC ATT GAC ATT GAT GAG TGT 2064 

3318 Ser Pro Gly Phe Thr Gly Gin Arg Cys Asn lie Asp lie Asp Glu Cys 

3319 670 675 680 685 
3320 

3321 GCC TCC AAT CCC TGT CGC AAG GGT GCA ACA TGT ATC AAC GGT GTG AAT 2112 

3322 Ala Ser Asn Pro Cys Arg Lys Gly Ala Thr Cys lie Asn Gly Val Asn 

3323 690 695 700 
3324 

3325 GGT TTC CGC TGT ATA TGC CCC GAG GGA CCC CAT CAC CCC AGC TGC TAC 2160 

3326 Gly Phe Arg Cys lie Cys Pro Glu Gly Pro His His Pro Ser Cys Tyr 

3327 705 710 715 
3328 

3329 TCA CAG GTG AAC GAA TGC CTG AGC AAT CCC TGC ATC CAT GGA AAC TGT 2208 

3330 Ser Gin Val Asn Glu Cys Leu Ser Asn Pro Cys lie His Gly Asn Cys 

3331 720 725 730 
3332 

3333 ACT GGA GGT CTC AGT GGA TAT AAG TGT CTC TGT GAT GCA GGC TGG GTT 2256 

3334 Thr Gly Gly Leu Ser Gly Tyr Lys Cys Leu Cys Asp Ala Gly Trp Val 

3335 735 740 745 
3336 

333 7 GGC ATC AAC TGT GAA GTG GAC AAA AAT GAA TGC CTT TCG AAT CCA TGC 23 04 

3338 Gly lie Asn Cys Glu Val Asp Lys Asn Glu Cys Leu Ser Asn Pro Cys 

3339 750 755 760 765 
3340 

3341 CAG AAT GGA GGA ACT TGT GAC AAT CTG GTG AAT GGA TAC AGG TGT ACT 23 52 

3342 Gin Asn Gly Gly Thr Cys Asp Asn Leu Val Asn Gly Tyr Arg Cys Thr 

3343 770 775 780 
3344 

3345 TGC AAG AAG GGC TTT AAA GGC TAT AAC TGC CAG GTG AAT ATT GAT GAA 2400 

3346 Cys Lys Lys Gly Phe Lys Gly Tyr Asn Cys Gin Val Asn lie Asp Glu 

3347 785 790 795 
3348 

3349 TGT GCC TCA AAT CCA TGC CTG AAC CAA GGA ACC TGC TTT GAT GAC ATA 2448 

3350 Cys Ala Ser Asn Pro Cys Leu Asn Gin Gly Thr Cys Phe Asp Asp lie 

3351 800 805 810 
3352 

3353 AGT GGC TAC ACT TGC CAC TGT GTG CTG CCA TAC ACA GGC AAG AAT TGT 24 96 

3354 Ser Gly Tyr Thr Cys His Cys Val Leu Pro Tyr Thr Gly Lys Asn Cys 

3355 815 820 825 
3356 

3357 CAG ACA GTA TTG GCT CCC TGT TCC CCA AAC CCT TGT GAG AAT GCT GCT 2544 

3358 Gin Thr Val Leu Ala Pro Cys Ser Pro Asn Pro Cys Glu Asn Ala Ala 

3359 830 835 840 845 
3360 

3361 GTT TGC AAA GAG TCA CCA AAT TTT GAG AGT TAT ACT TGC TTG TGT GCT 2592 

3362 Val Cys Lys Glu Ser Pro Asn Phe Glu Ser Tyr Thr Cys Leu Cys Ala 

3363 850 855 860 
3364 

3365 CCT GGC TGG CAA GGT CAG CGG TGT ACC ATT GAC ATT GAC GAG TGT ATC 2640 

3366 Pro Gly Trp Gin Gly Gin Arg Cys Thr lie Asp lie Asp Glu Cys lie 
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3367 865 870 875 

3368 

3369 TCC AAG CCC TGC ATG AAC CAT GGT CTC TGC CAT AAC ACC CAG GGC AGC 2688 

3370 Ser Lys Pro Cys Met Asn His Gly Leu Cys His Asn Thr Gin Gly Ser 

3371 880 885 890 
3372 

3 373 TAC ATG TGT GAA TGT CCA CCA GGC TTC AGT GGT ATG GAC TGT GAG GAG 2 736 

3374 Tyr Met Cys Glu Cys Pro Pro Gly Phe Ser Gly Met Asp Cys Glu Glu 

3375 895 900 905 
3376 

3377 GAC ATT GAT GAC TGC CTT GCC AAT CCT TGC CAG AAT GGA GGT TCC TGT 2 784 

3378 Asp lie Asp Asp Cys Leu Ala Asn Pro Cys Gin Asn Gly Gly Ser Cys 

3379 910 915 920 925 
3380 

3 381 ATG GAT GGA GTG AAT ACT TTC TCC TGC CTC TGC CTT CCG GGT TTC ACT 2 832 

3 382 Met Asp Gly Val Asn Thr Phe Ser Cys Leu Cys Leu Pro Gly Phe Thr 
3383 930 935 940 

3384 

3385 GGG GAT AAG TGC CAG ACA GAC ATG AAT GAG TGT CTG AGT GAA CCC TGT 2 880 

3386 Gly Asp Lys Cys Gin Thr Asp Met Asn Glu Cys Leu Ser Glu Pro Cys 

3387 945 950 955 
3388 

3389 AAG AAT GGA GGG ACC TGC TCT GAC TAC GTC AAC AGT TAC ACT TGC AAG 2 928 

3390 Lys Asn Gly Gly Thr Cys Ser Asp Tyr Val Asn Ser Tyr Thr Cys Lys 

3391 960 965 970 
3392 

3 3 93 TGC CAG GCA GGA TTT GAT GGA GTC CAT TGT GAG AAC AAC ATC AAT GAG 2 976 

3394 Cys Gin Ala Gly Phe Asp Gly Val His Cys Glu Asn Asn lie Asn Glu 

3395 975 980 985 
3396 

3397 TGC ACT GAG AGC TCC TGT TTC AAT GGT GGC ACA TGT GTT GAT GGG ATT 3 024 

3398 Cys Thr Glu Ser Ser Cys Phe Asn Gly Gly Thr Cys Val Asp Gly lie 

3399 990 995 1000 1005 
3400 

3401 AAC TCC TTC TCT TGC TTG TGC CCT GTG GGT TTC ACT GGA TCC TTC TGC 3 072 

3402 Asn Ser Phe Ser Cys Leu Cys Pro Val Gly Phe Thr Gly Ser Phe Cys 

3403 1010 1015 1020 
3404 

3405 CTC CAT GAG ATC AAT GAA TGC AGC TCT CAT CCA TGC CTG AAT GAG GGA 3120 

3406 Leu His Glu lie Asn Glu Cys Ser Ser His Pro Cys Leu Asn Glu Gly 

3407 1025 1030 1035 
3408 

3409 ACG TGT GTT GAT GGC CTG GGT ACC TAC CGC TGC AGC TGC CCC CTG GGC 3168 

3410 Thr Cys Val Asp Gly Leu Gly Thr Tyr Arg Cys Ser Cys Pro Leu Gly 

3411 1040 1045 1050 
3412 

3413 TAC ACT GGG AAA AAC TGT CAG ACC CTG GTG AAT CTC TGC AGT CGG TCT 3 216 

3414 Tyr Thr Gly Lys Asn Cys Gin Thr Leu Val Asn Leu Cys Ser Arg Ser 

3415 1055 1060 1065 
3416 

3417 CCA TGT AAA AAC AAA GGT ACT TGT GTT CAG AAA AAA GCA GAG TCC CAG 3264 
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His 


Gly 


Ala 


Ser 


Thr 


Val 


Leu 


Pro 


Ser 


Val 


Ser 




3703 










2210 








2215 








2220 




3704 




































3705 


CAG 


TTG 


CTA 


TCC 


CAC 


CAC 


CAC 


ATT 


GTG 


TCT 


CCA 


GGC 


AGT 


GGC 


AGT 


GCT 


6720 


3706 


Gin 


Leu 


Leu 


Ser 


His 


His 


His 


He 


Val 


Ser 


Pro 


Gly 


Ser 


Gly 


Ser 


Ala 




3707 








2225 








2230 








2235 






3708 




































3709 


GGA 


AGC 


TTG 


AGT 


AGG 


CTC 


CAT 


CCA 


GTC 


CCA 


GTC 


CCA 


GCA 


GAT 


TGG 


ATG 


6768 


3710 


Gly 


Ser 


Leu 


Ser 


Arg 


Leu 


His 


Pro 


Val 


Pro 


Val 


Pro 


Ala 


Asp 


Trp 


Met 




3711 






2240 








2245 








2250 








3712 




































3713 


AAC 


CGC 


ATG 


GAG 


GTG 


AAT 


GAG 


ACC 


CAG 


TAC 


AAT 


GAG 


ATG 


TTT 


GGT 


ATG 


6816 


3714 


Asn 


Arg 


Met 


Glu 


Val 


Asn 


Glu 


Thr 


Gin 


Tyr 


Asn 


Glu 


Met 


Phe 


Gly 


Met 




3715 




2255 








2260 








2265 










3716 




































3717 


GTC 


CTG 


GCT 


CCA 


GCT 


GAG 


GGC 


ACC 


CAT 


CCT 


GGC 


ATA 


GCT 


CCC 


CAG 


AGC 


6864 


3718 


Val 


Leu 


Ala 


Pro 


Ala 


Glu 


Gly Thr His 


Pro 


Gly 


He 


Ala 


Pro 


Gin 


Ser 




3719 


2270 








2275 








2280 








2285 




3720 




































3721 


AGG 


CCA 


CCT 


GAA 


GGG 


AAG 


CAC 


ATA 


ACC 


ACC 


CCT 


CGG 


GAG 


CCC 


TTG 


CCC 


6912 


3722 


Arg 


Pro 


Pro 


Glu 


Gly 


Lys 


His 


He 


Thr 


Thr 


Pro 


Arg 


Glu 


Pro 


Leu 


Pro 




3723 










2290 








2295 








2300 
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"3 TO A 

6 /z4 




































*3 *7 *3 C 




ATT 


GTG 


ACT 


TTC 


LAG 


CTC 


ATC 


CCT 


AAA 


/ion 

GGC 


AGT 


ATT 


GCC 


CAA 


CCA 


6960 




Pro 


lie 


Val 


Thr 


Phe 


bin 


Leu 


He 


Pro 


Lys 


Gly 


Ser 


He 


Ala 


Gin 


Pro 




6 1 Z 1 








2305 








2310 








2315 






"3 *7 *3 O 




































"3 *7 *3 Q 

j /zy 


GCG 


GGG 


GCT 


CCC 


CAG 


CCT 


CAG 


TCC 


ACC 


TGC 


CCT 


CCA 


GCT 


GTT 


GCG 


GGC 


7008 


o / JU 


Ala 


Gly Ala 


Pro 


Gin 


Pro 


Gin 


Ser 


Thr 


Cys 


Pro 


Pro 


Ala 


Val 


Ala 


Giy 










2320 








2325 








2330 








nil 






































CCC 


CTG 


CCC 


ACC 


ATG 


TAC 


CAG 


ATT 


CCA 


GAA 


ATG 


GCC 


CGT 


TTG 


CCC 


AGT 


T C C 


"3 "7 *a /i 


Pro 


Leu 


Pro 


Thr 


Met 


Tyr 


Gin 


He 


Pro 


Glu 


Met 


Ala 


Arg 


Leu 


Pro 


Ser 




"3 *7 "3 C 




2335 








2340 








2345 










"3 T5 C 

o / Jo 




































"3 *7 "3 T 
6 16 1 


GTG 


GCT 


TTC 


CCC 


ACT 


GCC 


ATG 


ATG 


CCC 


CAG 


CAG 


GAC 


GGG 


CAG 


GTA 


GCT 


7104 


J / jo 


Val 


Ala 


Phe 


Pro 


Thr 


Ala 


Met 


Met 


Pro 


Gin 


Gin 


Asp 


Gly 


Gin 


Val 


Ala 




"3 T "3 Q 
6 1 6? 


2350 








2355 








2360 








2365 




3740 




































3 741 


CAG 


ACC 


ATT 


CTC 


CCA 


GCC 


TAT 


CAT 


CCT 


TTC 


CCA 


GCC 


TCT 


GTG 


GGC 


AAG 


7152 




Gin 


Thr 


He 


Leu 


Pro 


Ala 


Tyr 


His 


Pro 


Phe 


Pro 


Ala 


Ser 


Val 


Gly 


Lys 




3 743 










2370 








2375 








2380 




3744 




































3745 


TAC 


CCC 


ACA 


CCC 


CCT 


TCA 


CAG 


CAC 


AGT 


TAT 


GCT 


TCC 


TCA 


AAT 


GCT 


GCT 


7200 


j/46 


Tyr 


Pro 


Thr 


Pro 


Pro 


Ser 


Gin 


His 


Ser 


Tyr 


Ala 


Ser 


Ser 


Asn 


Ala 


Ala 




11 Ail 








2385 








2390 








2395 










































3749 


GAG 


CGA 


ACA 


CCC 


AGT 


CAC 


AGT 


GGT 


CAC 


CTC 


CAG 


GGT 


GAG 


CAT 


CCC 


TAC 


7248 


J /5U 


Glu 


Arg 


Thr 


Pro 


Ser 


His 


Ser 


Gly 


His 


Leu 


Gin 


Gly 


Glu 


His 


Pro 


Tyr 




3751 






2400 








2405 








2410 








3752 




































3753 


CTG 


ACA 


CCA 


TCC 


CCA 


GAG 


TCT 


CCT 


GAC 


CAG 


TGG 


TCA 


AGT 


TCA 


TCA 


CCC 


7296 


3754 


Leu 


Thr 


Pro 


Ser 


Pro 


Glu 


Ser 


Pro 


Asp 


Gin 


Trp 


Ser 


Ser 


Ser 


Ser 


Pro 




3755 




2415 








2420 








2425 










3756 




































3757 


CAC 


TCT 


GCT 


TCT 


GAC 


TGG 


TCA 


GAT 


GTG 


ACC 


ACC 


AGC 


CCT 


ACC 


CCT 


GGG 


*7 "5 A A 
1 J44 


3758 


His 


Ser 


Ala 


Ser 


Asp 


Trp 


Ser 


Asp 


Val 


Thr 


Thr 


Ser 


Pro 


Thr 


Pro 


Gly 




3759 


2430 








2435 








2440 








2445 




3760 




































3761 


GGT 


GCT 


GGA 


GGA 


GGT 


CAG 


CGG 


GGA 


CCT 


GGG 


ACA 


CAC 


ATG 


TCT 


GAG 


CCA 


7392 


3762 


Gly Ala 


Gly 


Gly 


Gly 


Gin Arg Gly 


Pro 


Gly 


Thr 


His 


Met 


Ser 


Glu 


Pro 




3763 










2450 








2455 








2460 




3764 




































3765 


CCA 


CAC 


AAC 


AAC 


ATG 


CAG 


GTT 


TAT 


GCG 


TGAGAGAGTC CACCTCCAGT 




7439 


3766 


Pro 


His 


Asn 


Asn 


Met 


Gin 


Val 


Tyr 


Ala 


















3767 








2465 








2470 
















3768 




































3769 


GTAGAGACAT AACTGACTTT TGTAAATGCT GCTGAGGAAC 


AAATGAAGGT CATCCGGGAG 


7499 


3770 




































3771 


AGAAATGAAG AAATCTCTGG AGCCAGCTTC TAGAGGTAGG 


AAAGAGAAGA TGTTCTTATT 


7559 


3772 




































3773 


CAGATAATGC AAGAGAAGCA ATTCGTCAGT TTCACTGGGT 


ATCTGCAAGG CTTATTGATT 


7619 


3774 





































# 
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3775 ATTCTAATCT AATAAGACAA GTTTGTGGAA ATGCAAGATG AATACAAGCC TTGGGTCCAT 7679 
3776 

3777 GTTTACTCTC TTCTATTTGG AGAATAAGAT GGATGCTTAT TGAAGCCCAG ACATTCTTGC 7739 
3778 

3779 AGCTTGGACT GCATTTTAAG CCCTGCAGGC TTCTGCCATA TCCATGAGAA GATTCTACAC 7799 
3780 

3781 TAGCGTC CTG TTGGGAATTA TGCCCTGGAA TTCTGCCTGA ATTGACCTAC GCATCTCCTC 7859 
3782 

3783 CTCCTTGGAC ATTCTTTTGT CTTCATTTGG TGCTTTTGGT TTTGCACCTC TCCGTGATTG 7919 
3784 

3785 TAGCCCTACC AGCATGTTAT AGGGCAAGAC CTTTGTGCTT TTGATCATTC TGGCCCATGA 7979 
3786 

3 787 AAGCAACTTT GGTCTCCTTT CCCCTCCTGT CTTCCCGGTA TCCCTTGGAG TCTCACAAGG 8039 
3788 

3789 TTTACTTTGG TATGGTTCTC AGCACAAACC TTTCAAGTAT GTTGTTTCTT TGGAAAATGG 8099 
3790 

3791 ACATACTGTA TTGTGTTCTC CTGCATATAT CATTCCTGGA GAGAGAAGGG GAGAAGAATA 8159 
3792 

3793 CTTTTCTTCA ACAAATTTTG GGGGCAGGAG ATCCCTTCAA GAGGCTGCAC CTTAATTTTT 8219 
3794 

3 795 CTTGTCTGTG TGCAGGTCTT CATATAAACT TTACCAGGAA GAAGGGTGTG AGTTTGTTGT 8279 
3796 

3797 TTTTCTGTGT ATGGGCCTGG TCAGTGTAAA GTTTTATCCT TGATAGTCTA GTTACTATGA 8339 
3798 

3799 CCCTCCCCAC TTTTTTAAAA CCAGAAAAAG GTTTGGAATG TTGGAATGAC CAAGAGACAA 8399 
3800 

3801 GTTAACTCGT GCAAGAGCCA GTTACCCACC CACAGGTCCC CCTACTTCCT GCCAAGCATT 8459 
3802 

3803 CCATTGACTG CCTGTATGGA ACACATTTGT CCCAGATCTG AGCATTCTAG GCCTGTTTCA 8519 
3804 

3805 CTCACTCACC CAGCATATGA AACTAGTCTT AACTGTTGAG CCTTTCCTTT CATATCCACA 8579 
3806 

3807 GAAGACACTG TCTCAAATGT TGTACCCTTG CCATTTAGGA CTGAACTTTC CTTAGCCCAA 863 9 
3808 

3809 GGGACCCAGT GACAGTTGTC TTCCGTTTGT CAGATGATCA GTCTCTACTG ATTATCTTGC 8699 
3810 

3811 TGCTTAAAGG CCTGCTCACC AATCTTTCTT TCACACCGTG TGGTCCGTGT TACTGGTATA 8759 
3812 

3813 CCCAGTATGT TCTCACTGAA GACATGGACT TTATATGTTC AAGTGCAGGA ATTGGAAAGT 8819 
3814 

3815 TGGACTTGTT TTCTATGATC CAAAACAGCC CTATAAGAAG GTTGGAAAAG GAGGAACTAT 8879 
3816 

3817 ATAGCAGCCT TTGCTATTTT CTGCTACCAT TTCTTTTCCT CTGAAGCGGC CATGACATTC 893 9 
3818 

3819 CCTTTGGCAA CTAACGTAGA AACTCAACAG AACATTTTCC TTTCCTAGAG TCACCTTTTA 8999 
3820 

3821 GATGATAATG GACAACTATA GACTTGCTCA TTGTTCAGAC TGATTGCCCC TCACCTGAAT 9059 
3822 

3 823 CCACTCTCTG TATTCATGCT CTTGGCAATT TCTTTGACTT TCTTTTAAGG GCAGAAGCAT 9119 
3824 

3825 TTTAGTTAAT TGTAGATAAA GAATAGTTTT CTTCCTCTTC TCCTTGGGCC AGTTAATAAT 9179 
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3826 

3827 TGGTCCATGG CTACACTGCA ACTTCCGTCC AGTGCTGTGA TGCCCATGAC ACCTGCAAAA 9239 
3828 

3829 TAAGTTCTGC CTGGGCATTT TGTAGATATT AACAGGTGAA TTCCCGACTC TTTTGGTTTG 9299 
3830 

3 831 AATGACAGTT CTCATTCCTT CTATGGCTGC AAGTATGCAT CAGTGCTTCC CACTTACCTG 9359 
3832 

3833 ATTTGTCTGT CGGTGGCCCC ATATGGAAAC CCTGCGTGTC TGTTGGCATA ATAGTTTACA 9419 
3834 

3835 AATGGTTTTT TCAGTCCTAT CCAAATTTAT TGAACCAACA AAAATAATTA CTTCTGCCCT 9479 
3836 

3837 GAGATAAGCA GATTAAGTTT GTTCATTCTC TGCTTTATTC TCTCCATGTG GCAACATTCT 9539 
3838 

3839 GTCAGCCTCT TTCATAGTGT GCAAACATTT TATCATTCTA AATGGTGACT CTCTGCCCTT 9599 
3840 

3841 GGACCCATTT ATTATTCACA GATGGGGAGA ACCTATCTGC ATGGACCCTC ACCATCCTCT 9659 
3842 

3843 GTGCAGCACA CACAGTGCAG GGAGCCAGTG GCGATGGCGA TGACTTTCTT CCCCTGGGAA 9719 
3844 

3845 TTCC 9723 

3846 

3847 
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